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COMBINATORIAL POLYKETIDE LIBRARIES PRODUCED USING A 
MODULAR PKS GENE CLUSTER AS SCAFFOLD 

Reference to Government Funding 
5 This work was supported in part by a grant Scorn the National Institutes of 

Health, CA66736. The U.S. government has certain rights in this invention. 

Technical Field 

The invention relates to the field of combinatorial libraries, to novel 
1 0 polyketides and antibiotics and to methods to prepare them. More particularly, it 

concerns construction of new polyketides and to libraries of polyketides synthesized 
by polyketide synthases derived fix)m a naturally occurring PKS, as illustrated by the 
erythromycin gene cluster. 

15 Background Art 

Polyketides represent a large family of diverse compoimds ultimately 
synthesized from 2-carbon units through a series of Claisen-type condensations and 
subsequent modifications. Members of this group include antibiotics such as 
tetracyclines, anticancer agents such as daunomycin, and immunosuppressants such as 

2 0 FK506 and rapamycin. Polyketides occur in many types of organisms including fimgi 
and mycelial bacteria, in particular, the actinomycetes. 

A number of lactones of keto acids have been synthesized using standard 
organic chemistry. These include a series of unsaturated ketolactones synthesized by 
Vedejes et al, 1 Am Chem Soc (1987) 109:5437-5446, shown as formulas 201, 202 

25 and 203 in Figure 1 1 herein. Additional compounds of formulas 204 and 205, also 
shown in Figure 1 1 were synthesized as reported by Vedejes et al J Am Chem Soc 
(1989) 1 1 1:8430-8438. In addition, compoimds 206-208 (Figure 1 1) were 

synthesized by Borowitz (1975) ; compound 209 has been synthesized 

by Ireland et al.JOrg Chem (1980) 45:1868-1880. 

30 The polyketides are synthesized in vivo by polyketide synthases (PKS). This 

group of enzymatically active proteins is considered in a different category from the 
fatty acid synthases which also catalyze condensation of 2-carbon units to result in. 
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for example, fatty acids and prostaglandins. Two major types of PKS are known 
which are vastly different in their construction and mode of synthesis. These are 
commonly referred to as Type I or "modular" and Type II, "aromatic." 

The PKS scaffold that is the subject of the present invention is a member of 
5 the group designated Type I or "modular^' PKS- In this type, a set of separate active 
sites exists for each step of carbon chain assembly and modification, but the 
individual proteins contain a multiplicity of such separate active sites. There may be 
only one multifunctional protein of this type, such as that required for the biosynthesis 
of 6-methyl salicylic acid (Beck, J. etaL EurJBiochem (1990) 192:487-498; Davis, 

10 R, al. Abstracts of Genetics of Industrial Microorganism Meeting, Montreal, 
Abstract P288 (1994)). More commonly, and in bacterial-derived Type I PKS 
assemblies, there are several such multifiinctional proteins assembled to result in the 
end product polyketide. (Cortes, J. et al. Nature (1990) 348:176; Donadio, S. et al, 
Science (1991) 252:675; MacNeil, D J. et al. Gene (1992) 115:1 19.) 

15 Anumber of modular PKS genes have been cloned. U.S. Patent No, 

5,252,474 describes cloning of genes encoding the synthase for avennectin; U.S. 
Patent No. 5,098,837 describes the cloning of genes encoding the synthase for 
spiramycin; European application 791,655 and European application 791,656 describe 
the genes encoding the synthases for tylosin and platenoUde respectively. 

2 0 The PKS for erythromycin, used as an illustrative system is a modular PKS, 

Erythromycin was originally isolated from S. erythraeus (since reclassified as 
Saccharopolyspora erythrea) which was found in a soil sample from the Philippine 
archipelago. Cloning the genes was described by Donadio, S, et ai. Science (1991) 
252:675. The particulars have been reviewed by Perun, T J. in Drug Action and Drug 

2 5 Resistance in Bacteria, Vol, 1, S. Mitsuhashi (ed.) University Park Press, Baltimore, 

1977. The antibiotic occurs in various glycosylated forms, designated A, B and C 
during various stages of fermentation. The entire erythromycin biosynthetic gene 
cluster from 5. erythraeus has been mapped and sequenced by Donadio et ai in 
Industrial Microorganisms: Basic and Applied Molecular Genetics (1993) R.H. 

3 0 Baltz, G.D. Hegeman, and P.L. Skatrud (eds.) (Amer Soc Microbiol) and the entire 

PKS is an assembly of three such multifunctional proteins usually designated 
DEBS-1, DEBS-2, and DEBS-3, encoded by thre separate genes. 
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Expression of the genes encoding the PKS complex may not be sufficient to 
pennit the production by the synthase enzymes of polyketides when the genes are 
transformed into host cells that do not have the required auxiliary 
phosphopantetheinyl transferase enzymes which posttranslationally modify the ACP 
5 domains of the PKS. Genes encoding some of these transferases are described in 
W097/13845. In addition, enzymes that mediate glycosylation of the polyketides 
synthesized are described in WO97/23630. US, Serial No. 08/989,332 filed 
1 1 December 1997 describes the production of polyketides in hosts that normally do 
not produce them by supplying appropriate phosphopantetheinyl transferase 
1 0 expression systems. The contents of this application are incorporated herein by 
reference. 

There have been attempts to alter the polyketide synthase pathway of modular 
PKS clusters. For example, European application 238,323 describes a process for 
enhancing production of polyketides by introducing a rate-limiting synthase gene and 

1 5 U.S. Patent No. 5,5 14,544 describes use of an activator protein for the synthase in 
order to enhance production. U.S. Patent Nos. 4,874,748 and 5,149,639 describe 
shuttle vectors that are useful in cloning modular PKS genes in general. Methods of 
introducing an altered gene into a microorganism chromosome are described in 
W093/13663. Modification of the loading module for the DEBS-1 protein of the 

2 0 erythromycin-producing polyketide synthase to substitute the loading module for the 
avermectin-producing polyketide synthase in order to vary the starter unit was 
described by Marsden, Andrew F*A. et al Science (1998) 279:199-202 and Oliynyk, 
M. et al Chemistry and Biology (1996) 3:833-839. WO 98/01571, pubUshed 
15 January 1998, describes manipulation of the erythromycin PKS and novel 

2 5 polyketides resulting from such manipulation. In addition, WO 98/01 56, also 

pubUshed 15 January 1998 describes a hybrid modular PKS gene for varying the 
nature of the starter and extender units to synthesize novel polyketides. 

hx addition, U.S. Patent Nos. 5,063,155 aud 5,168,052 describe preparation of 
novel antibiotics using modular PKS systems. A number of modular PKS have been 

30 cloned. See, e.g., U.S. Patent No, 5,098,837, EP 791,655, EP 791,656 and US. Patent 
No. 5,252,474. 
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Type n PKS, in contrast to modular PKS, include several proteins, each of 
which is simpler than those found in Type I polyketide synthases. The active sites in 
these enzymes are used iteratively so that the proteins themselves are generally 
monofunctional or bifunctional. For example, the aromatic PKS complexes derived 
5 from Streptomyces have so far been found to contain three proteins encoded in three 
open reading frames. One protein provides ketosynthase (KS) and acyltransferase 
(AT) activities, a second provides a chain length determining factor (CLDF) and a 
third is an acyl carrier protein (ACP). 

The present invention is concerned with PKS systems derived from modular 
1 0 PKS gene clusters. The nature of these clusters and their manipulation are further 
described below. 

Disclosure of the Invention 

The invention provides recombinant materials for the production of 
1 5 combinatorial libraries of polyketides wherein the polyketide members of the library 
are synthesized by various PKS systems derived from naturally occurring PKS 
systems by using these systems as scaffolds. Generally, many members of these 
libraries may themselves be novel compounds, and the invention fiirther includes 
novel polyketide members of these libraries. The invention methods may thus be 

2 0 directed to the preparation of an individual polyketide. The polyketide may or may 

not be novel, but the method of preparation permits a more convenient method of 
preparing it. The resulting polyketides may be further modified to convert them to 
antibiotics, typically through glycosylation. The invention also includes methods to 
recover novel polyketides with desired binding activities by screening the libraries of 
25 the invention. 

Thus, in one aspect, the invention is directed to a method to prepare a nucleic 
acid which contains a nucleotide sequence encoding a modified polyketide synthase. 
The method comprises using a naturally occurring PKS encoding sequence as a 
scaffold and modifymg the portions of the nucleotide sequence that encode enzymatic 

3 0 activities, either by mutagenesis, inactivation, or replacement. The thus modified 

nucleotide sequence encoding a PKS can then be used to modify a suitable host cell 
and the cell thus modified employed to produce a polyketide different from that 
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produced by the PKS whose scaffolding has been used to support modifications of 
enzymatic activity. The invention is also directed to polyketides thus produced and 
the antibiotics to which they may then be converted. 

In another aspect, the invention is directed to a multiplicity of cell colonies 
comprising a library of colonies wherein each colony of the library contains an 
expression vector for the production of a different modular PKS, but derived from a 
naturally occurring PKS. In a preferred embodiment, the different PKS are derived 
from the erythromycin PKS. In any case, the library of different modular PKS is 
obtained by modifying one or more of the regions of a naturally occurring gene or 
gene cluster encoding an enzymatic activity so as to alter that activity, leaving intact 
the scaffold portions of the naturally occurring gene. If desired, more than one 
scaffold source may be used, but basing the cluster of modules on a single scaffold is 
preferred. In another aspect, the invention is directed to a multiplicity of cell colonies 
comprising a library of colonies wherein each colony of the Ubrary contains a 
different modular PKS derived from a naturally occurring PKS, preferably the 
erythromycin PKS. The invention is also directed to methods to produce libraries of 
PKS complexes and to produce libraries of polyketides by culturing these colonies, as 
well as to the libraries so produced. In addition, the invention is directed to methods 
to screen the resulting polyketide libraries and to novel polyketides contained therein. 

Brief Description of the Drawings 

Figure 1 A is a diagram of the erythromycin PKS complex from 5. erythraeus 
showing the function of each multifunctional protein, and also shows the structure of 
6dEB and of D-desosamine and L-cladinose. 

Figure IB shows a diagram of the post-PKS biosynthesis of erythromycins 

A-D. 

Figure 2 is a diagram of DEBS-1 from S, erythraeus showing the functional 
regions separated by linker regions. 

Figure 3 shows a diagram of a vector containing the entire erythromycin gene 

cluster. 

Figure 4 shows a method for the construction of the vector of Figure 3. 
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Figure 5 shows a diagram of the erythromycin gene clxister with locations of 
restriction sites introduced for ease of manipulation. 

Figures 6A-6H show the structures of polyketides produced by manipulating 
the erythromycin PKS gene cluster. 

Figures 7 A, 7B and 7C show the construction of derivative PKS gene clusters 
from the vector of Figure 3. 

Figure 8 shows antibiotics obtained from the polyketides of Figure 6A-6F. 

Figure 9 shows the preparation of a polyketide containing an unsaturated 
starter moiety and the corresponding antibiotic. 

Figure 1 0 shows the preparation of a reagent used to glycosylate polyketides 
to prepare the D-desosamine derivatives with antibiotic activity. 

Figure 1 1 shows the structures of known, previously produced, 12-member 
macrolides. 

Figures 12A and 12B show the structures of known and previously produced 
14-member macrolides. 

Modes of Carrying Out the Invention 

It may be helpftil to review the nature of the erythromycin PKS complex and 
the gene cluster that encodes it as a model for modular PKS, in general. 

Figure lA is a diagrammatic representation of the gene cluster encoding the 
synthase for the polyketide backbone of the antibiotic erythromycin. The 
erythromycin PKS protein assembly contains three high-molecular-weight proteins 
(>200 kD) designated DEBS-1, DEBS-2 and DEBS-3, each encoded by a separate 
gene (Caffrey et aL, FEES Lett (1992) 304:225). The diagram in Figure 1 A shows 
that each of the three protems contains two modules of the synthase - a module being 
that subset of reactivities required to provide an additional 2-carbon unit to the 
molecule. As shown in Figure lA, modules 1 and 2 reside on DEBS-1; modules 3 
and 4 on DEBS-2 and modules 5 and 6 on DEBS-3. The minimal module is typified 
in module 3 which contains a ketosynthase (KS), an acyltransferase (AT) and an acyl 
carrier protein (ACP). These three functions are sufficient to activate an extender unit 
and attach it to the remainder of the growing molecule. Additional activities that may 
be included in a module relate to reactions other than the Claisen condensation, and 
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include a dehydratase activity (DH), an enoylreductase activity (ER) and a 
ketoreductase activity (KR). The first module also contains repeats of the AT and 
ACP activities because it catalyzes the initial condensation, i.e. it begins with a 
"loading domain" represented by AT and ACP, which determine the nature of the 
starter unit. Although not shown, module 3 has a KR region which has been 
inactivated by mutation. The "finishing" of the molecule is regulated by the 
thioesterase activity (TE) in module 6. This thioesterase appears to catalyze 
cyclization of the macrolide ring thereby increasing the yield of the polyketide 
product. 

The product in this case is 6dEB; the structure and nxunbering system for this 
molecule are shown in Figure 1 A. Conversion to the antibiotics erythromycin A, B, C 
and D would require glycosylation generally by D-desosamine or L-mycarose, which 
may ultimately be converted to cladinose at appropriate locations. Figure IB 
diagrams the post-PKS biosynthesis of the erythromycins through addition of glycosyl 
groups. 

As shown, 6dEB is converted by the gene eryF to erythronolide B which is, in 
turn, glycosylated by eryB to obtain 3-0-mycarosylerythronolide B which contains 
L-mycarose at position 3. The enzyme eryC then converts this compound to 
erythromycin D by glycosylation with D-desosamine at position 5. Erythromycin D, 
therefore, differs from 6dEB through glycosylation and by the addition of a hydroxy 1 
group at position 6. Erythromycin D can be converted to erythromycin B in a reaction 
catalyzed by eryG by methylating the L-mycarose residue at position 3. 
Erythromycin D is converted to erythromycin C by the addition of a hydroxyl group 
at position 12. Erythromycin A is obtained from erythromycin C by methylation of 
the mycarose residue catalyzed by eryG. The series of erythromycin antibiotics, then, 
differs by the level of hydroxylation of the polyketide framework and by the 
methylation status of the glycosyl residues. 

Figure 2 shows a detailed view of the regions m the first two modules which 
comprise the first open reading firame encoding DEBS-1. The regions that encode 
enzymatic activities are separated by linker or "scaflfold"-encoding regions. These 
scaffold regions encode amino acid sequences that space the enzymiatic activities at 
the appropriate distances and in the correct order. Thus, tiiese linker regions 
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coUectiveiy can be considered to encode a scaffold into which the various activities 
are placed in a particular order and spatial arrangement. This organization is similar 
in the remaining genes, as well as in other naturally occurring modular PKS gene 
clusters. 

5 The three DEBS-1, 2 and 3 proteins are encoded by the genetic segments 

ery-AI, ery-AII and ery-AIH, respectively. These reading frames are located on the 
bacterial chromosome starting at about 10 kb distant from the erythromycin resistance 
gene (ennE or eryR). 

The detailed description above referring to erythromycin is typical for modular 

1 0 PKS in general. Thus, rather than the illustrated erythromycin, the polyketide 

synthases making up the libraries of the invention can be derived from the synthases 
of other modular PKS, such as those which result in the production of rapamycin, 
avermectin, FK-506, FR-008, monensin, rifamycin, soraphen-A, spinocyn, 
squalestatin, or tylosin, and the like. 

1 5 Regardless of the naturally occurring PKS gene used as a scaffold, the 

invention provides libraries or individual modified forms, ultimately of polyketides, 
by generating modifications in the erythromycin PKS or other naturally occurring 
PKS gene cluster so that the protein complexes produced by the cluster have altered 
activities in one or more respects, and thus produce polyketides other than the natural 

2 0 product of the PKS. Novel polyketides may thus be prepared, or polyketides in 

general prepared more readily, using this method. By providing a large number of 
different genes or gene clusters derived fi^m a naturally occurring PKS gene cluster, 
each of which has been modified in a different way from the native cluster, an 
effectively combinatorial library of polyketides can be produced as a resuh of the 

2 5 multiple variations in these activities. All of the PKS encoding sequences used in the 
present invention represent modular polyketide synthases "derived from" a naturally 
occurring PKS, illustrated by the erythromycin PKS. As will be further described 
below, the metes and bounds of this derivation can be described on both the protein 
level and the encoding nucleotide sequence level. 

30 By a modular PKS "derived fix)m" the erythromycin or other naturally 

occxirring PKS is meant a modular polyketide synthase (or its corresponding encoding 
gene(s)) that retains the scaffolding of all of the utilized portion of the naturally 
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occurring gene. (Not all modules need be included in the constructs.) On the 
constant scaffold, at least one enzymatic activity is mutated, deleted or replaced, so as 
to alter the activity. Alteration results when these activities are deleted or are replaced 
by a different version of the activity, or simply mutated in such a way that a 
5 polyketide other than the natural product results from these collective activities. This 
occurs because there has been a resulting alteration of the starter unit and/or extender 
unit, and/or stereochemistry, and/or chain length or cyclization and/or reductive or 
dehydration cycle outcome at a corresponding position in the product polyketide. 
Where a deleted activity is replaced, the origin of the replacement activity may come 

1 0 from a corresponding activity in a different naturally occurring polyketide synthase or 
from a different region of the same PKS. In the case of erythromycin, for example, 
any or all of the DEBS-1, DEBS-2 and DEBS-3 proteins may be included in the 
derivative or portions of any of these may be included; but the scaffolding of an 
erythromycin PKS protein is retained in whatever derivative is considered. Similar 

1 5 coraments pertain to the corresponding ery- AI, ery- All and ery-AIII genes. 

The derivative may contain preferably at least a thioesterase activity from the 
erythromycin or other naturally occurring PKS gene cluster. 

In summary, a polyketide synthase "derived from'' a naturally occurring PKS 
contains the scaffolding encoded by all or the portion employed of the naturally 

2 0 occurring synthase gene, contains at least two modules that are functional, preferably 
three modules, and more preferably four or more modules and contains mutations, 
deletions, or replacements of one or more of the activities of these functional modules 
so that the nature of the resulting polyketide is altered. This definition applies both at 
the protein and genetic levels* Particular preferred embodiments mclude those 

2 5 wherein a KS, AT, KR, DH or ER has been deleted or replaced by a version of the 

activity from a different PKS or from another location within the same PKS. Also 
preferred are derivatives where at least one noncondensation cycle enzymatic activity 
(KR, DH or ER) has been deleted or wherein any of these activities has been mutated 
so as to change the ultimate polyketide synthesized. 

3 0 Thus, there are five degrees of freedom for constructing a polyketide synthase 

in temis of the polyketide that will be produced. First, the polyketide chain length 
will be deteraiined by the number of modules in the PKS. Second, the nature of the 
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carbon skeleton of the PKS will be determined by the specificities of the acyl 
transferases which determine the nature of the extender units at each position - e.g., 
malonyl, methyl malonyl, or ethyl malonyl, etc. Third, the loading domain specificity 
will also have an effect on the resulting carbon skeleton of the polyketide. Thus, the 
5 loading domain may use a different starter unit, such as acetyl, propionyl, and the like. 
Fourth, the oxidation state at various positions of the polyketide will be determined by 
the dehydratase and reductase portions of the modules. This will determine the 
presence and location of ketone, alcohol, double bonds or single bonds in the 
polyketide. Finally, the stereochemistry of the resulting polyketide is a function of 

1 0 three aspects of the synthase. The first aspect is related to the AT/KS specificity 

associated with substituted malonyls as extender units, which affects stereochemistry 
only when the reductive cycle is missing or when it contains only a ketoreductase 
since the dehydratase would abolish chirality. Second, the specificity of the 
ketoreductase will determine the chirality of any P-OH. Fmally, the enoyl reductase 

1 5 specificity for substituted malonyls as extender units will influence the result when 
there is a complete KR/DH/ER available. 

In the working examples below, the foregoing variables for varying loading 
domain specificity which controls the starter unit, a useful q)proach is to modify the 
KS activity in module 1 which results in the ability to incorporate alternative starter 

2 0 units as well as module 1 extended units. This approach was illustrated in PCT 

apphcation US/96/1 1317 wherein the KS-I activity was inactivated through mutation. 
Polyketide synthesis is then initiated by feeding chemically synthesized analogs of 
module 1 diketide products. Working examples of this aspect are also presented 
hereinbelow. 

2 5 Thus, the modular PKS systems, and in particular, the erythromycin PKS 

system, permit a wide range of polyketides to be synthesized. As compared to the 
aromatic PKS systems, a wider range of starter units including aliphatic monomers 
(acetyl, propionyl, butyryl, isovaleryl, etc.), aromatics (aminohydroxybenzoyl), 
alicyclics (cyclohexanoyl), and heterocyclics (thiazolyl) are found in various 

3 0 macrocycUc polyketides. Recent studies have shown that modular PKSs have relaxed 

specificity for their starter units (Kao ei al Science (1994), supra). Modular PKSs 
also exhibit considerable variety with regard to the choice of extender units in each 
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condensation cycle. The degree of P-ketoreduction following a condensation reaction 
has also been shown to be altered by genetic manipulation (Donadio et al Science 
{\99\\supra\ Donadio, S. etal Proc Natl Acad Sci USA (1993) 90:7119-7123), 
Likewise, the size of the polyketide product can be varied by designing mutants with 
5 the appropriate number of modules (Kao, C. M. et al J Am Chem Soc (1994) 

1 16:1 1612-1 1613), Lastly, these enzymes are particularly well-known for generating 
an impressive range of asymmetric centers in their products in a highly controlled 
manner. The polyketides and antibiotics produced by the methods of the present 
invention are typically single stereoisomeric forms. Although the compoxmds of the 

1 0 invention can occur as mixtures of stereoisomers, it is more practical to generate 

individual stereoisomers using this system. Thus, the combinatorial potential within 
modular PKS pathways based on any naturally occurring modular, such as the 
erythromycin, PKS scaffold is virtually unlimited. 

In general, the polyketide products of the PKS must be further modified, 

1 5 typically by glycosylation, in order to exhibit antibiotic activity. Methods for 

glycosylating the polyketides are generally known in the art; the glycosylation may be 
effected intracellularly by providing the appropriate glycosylation enzymes or may be 
effected in vitro using chemical synthetic means. 

The antibiotic modular polyketides may contain any of a number of different 

2 0 sugars, although D-desosamine, or a close analog thereof, is most common. 

Erythromycin, picromycin, narbomycin and methymycin contain desosamine. 
Erythromycin also contains L-cladinose (3-O-methyl mycarose). Tylosin contains 
mycaminose (4-hydroxy desosandne), mycarose and 6-deoxy-D-allose. 2-acetyl-l- 
bromodesosamine has been used as a donor to glycosylate polyketides by Masamune 
25 et al J Am Chem Soc (1975) 97:3512, 3513. Other, apparently more stable, donors 

include glycosyl fluorides, thioglycosides, and trichloroacetimidates; Woodward, R.B. 
et al J Am Chem Soc (1981) 103:3215; Martin, S.F. et al Am Chem Soc (1997) 
119:3193; Toshima,K. etalJAm Chem Soc (1995) 117:3717; Matsumoto, T.etal 
Tetrahedron Lett (1988) 29:3575. Glycosylation can also be effected using the 

3 0 macrolides as starting materials and using mutants of S. erythraea that are unable to 

synthesize the macrolides to make the conversion. A method is illustrated m the 
Examples hereinbelow. 
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Methods to Construct Multiple Modular PKS Derived from a Naturally Occurring 
PKS 

The derivatives of the a naturally occiming PKS can be prepared by 
5 manipulation of the relevant genes. A large number of modular PKS gene clusters 

have been mapped and/or sequenced, including erythromycin, soraphen A, rifamycin, 
and r^amycin, which have been completely mapped and sequenced, and FK506 and 
oleandomycin which have been partially sequenced, and candicidin, aveimectin, and 
nemadectin which have been mapped and partially sequenced. Additional modular 

1 0 PKS gene clusters are expected to be available as time progresses. These genes can 
be manipulated using standard techniques to delete or inactivate activity encoding 
regions, insert regions of genes encoding corresponding activities form the same or 
different PKS system, or otherwise mutated using standard procedures for obtaining 
genetic alterations. Of course, portions of, or all of, the desired derivative coding 

1 5 sequences can be synthesized using standard solid phase synthesis methods such as 
those described by Jaye et al, J Biol Chem (1984) 259:6331 and which are available 
commercially from, for example, Applied Biosystems, Inc. 

In order to obtain nucleotide sequences encoding a variety of derivatives of the 
naturally occurring PKS, and thus a variety of polyketides for construction of a 

2 0 library, a desired nxmiber of constructs can be obtained by "mixing and matching" 

enzymatic activity-encoding portions, and mutations can be introduced into the native 
host PKS gene cluster or portions thereof. 

Mutations can be made to the native sequences using conventional techniques. 
The substrates for mutation can be an entire cluster of genes or only one or two of 

2 5 them; the substrate for mutation may also be portions of one or more of these genes. 
Techniques for mutation include preparing synthetic oligonucleotides including the 
mutations and inserting the mutated sequence into the gene encoding a PKS subunit 
using restriction endonuclease digestion. (See, e.g., Kunkel, T A. Proc Natl Acad Sci 
USA (1985) 82:448; Geisselsoder et al BioTechniques (1987) 5:7860 Alternatively, 

30 the mutations can be effected using a mismatched primer (generally 1 0-20 nucleotides 
in length) which hybridizes to the native nucleotide sequence (generally cDNA 
corresponding to the UNA sequence), at a temperature below the melting temperature 



wo 98/49315 



-13- 



PCT/US98/08792 



of the mismatched duplex. The primer can be made specific by keeping primer length 
and base composition within relatively narrow limits and by keeping the mutant base 
centrally located. Zoller and Smith, il/e/Aorf^^nzywo/ (1983) 100:468. Primer 
extension is effected using DNA polymerase, the product cloned and clones 
5 containing the mutated DNA, derived by segregation of the primer extended strand, 
selected. Selection can be accomplished using the mutant primer as a hybridization 
probe. The technique is also applicable for generating multiple point mutations. See, 
e.g., Dalbie-McFarland et al Proc Natl Acad Sci USA (1982) 79:6409, PGR 
mutagenesis will also find use for effecting the desired mutations. 

1 0 Random mutagenesis of selected portions of the nucleotide sequences 

encoding enzymatic activities can be accomplished by several different techniques 
known in the art, e.g., by inserting an oligonucleotide linker randomly into a plasmid, 
by irradiation with X-rays or ultraviolet light, by incorporating incorrect nucleotides 
during in vitro DNA synthesis, by error-prone PGR mutagenesis, by preparing 

15 synthetic mutants or by damaging plasmid DNA in vitro with chemicals. Chemical 
mutagens include, for example, sodium bisulfite, nitrous acid, nitrosoguanidine, 
hydroxylamine, agents which damage or remove bases thereby preventing normal 
base-pairing such as hydrazine or formic acid, analogues of nucleotide precursors 
such as 5-bromouracil, 2-aminopurine, or acridine intercalating agents such as 

2 0 proflavine, acriflavine, quinacrine, and the like. Generally, plasmid DNA or DNA 
firagments are treated with chemicals, transformed into E. coli and propagated as a 
pool or library of mutant plasmids. 

In addition to providing mutated forms of regions encoding enzymatic 
activity, regions encoding corresponding activities from different PKS synthases or 

2 5 firom different locations in the same PKS synthase can be recovered, for example, 

using PGR techniques with appropriate primers. By "correspondrng" activity 
encoding regions is meant those regions encoding the same general type of activity - 
e.g., a ketoreductase activity in one location of a gene cluster would "correspond" to a 
ketoreductase-encoding activity in another location in the gene cluster or in a different 

3 0 gene cluster; similarly, a complete reductase cycle could be considered 

correspondmg - e.g., KR/DH/ER would correspond to KR alone. 
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If replacement of a particular target region in a host polyketide synthase is to 
be made, this replacement can be conducted in vitro using suitable restriction 
enzymes or can be effected in vivo using recombinant techniques involving 
homologous sequences framing the replacement gene in a donor plasmid and a 
receptor region in a recipient plasmid. Such systems, advantageously involving 
plasmids of differing temperature sensitivities are described, for example, in PCX 
application WO 96/40968. 

The vectors used to perform the various operations to replace the enzymatic 
activity in the host PKS genes or to support mutations in these regions of the host 
PKS genes may be chosen to contain control sequences operably linked to the 
resulting coding sequences in a manner that expression of the coding sequences may 
be effected in a appropriate host. However, simple cloning vectors may be used as 
well. 

If the cloning vectors employed to obtain PKS genes encoding derived PKS 
lack control sequences for expression operably linked to the encoding nucleotide 
sequences, the nucleotide sequences are inserted into appropriate expression vectors. 
This need not be done individually, but a pool of isolated encoding nucleotide 
sequences can be inserted into host vectors, the resulting vectors transformed or 
transfected into host cells and the resulting cells plated out into individual colonies. 

Suitable control sequences include those which function in eucaryotic and 
procaryotic host cells. Preferred host include fungal systems such as yeast and 
procaryotic hosts, but single cell cultures of, for example, mammalian cells could also 
be used. There is no particular advantage, however, in using such systems. 
Particularly preferred are yeast and procaryotic hosts which use control sequences 
compatible with Streptomyces spp. Suitable controls sequences for single cell 
cultures of various types of organisms are well known in the art. Control systems for 
expression in yeast, including controls which effect secretion are widely available are 
routinely used. Control elements include promoters, optionally containing operator 
sequences, and other elements depending on the nature of the host, such as ribosome 
binding sites. Particularly useful promoters for procaryotic hosts include those from 
PKS gene clusters which resuU in the production of polyketides as secondary 
metabolites, includmg those from aromatic (Type II) PKS gene clusters. Examples 
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are act promoters, tcm promoters, spiramycin promoters, and the like. However, 
other bacterial promoters, such as those derived from sugar metabolizing enzymes, 
such as galactose, lactose (lac) and maltose, are also useful. Additional examples 
include promoters derived from biosynthetic enzymes such as tryptophan {trp)^ the 
5 p-lactamase (bla), bacteriophage lambda PL, and T5. In addition, synthetic 
promoters, such as the tac promoter (U.S. Patent No. 4,551,433), can be used. 

Other regulatory sequences may also be desirable which allow for regulation 
of expression of the PKS replacement sequences relative to the growth of the host 
cell Regulatory sequences are known to those of skill in the art, and examples 

1 0 include those which cause the expression of a gene to be turned on or off in response 
to a chemical or physical stimulus, including the presence of a regulatory compoimd. 
Other types of regulatory elements may also be present in the vector, for example, 
enhancer sequences. 

Selectable markers can also be included in the recombinant expression 

1 5 vectors. A variety of markers are known which are useful in selecting for transformed 
cell lines and generally comprise a gene whose expression confers a selectable 
phenotype on transformed cells when the cells are grown in an appropriate selective 
mediimi. Such markers include, for example, genes which confer antibiotic resistance 
or sensitivity to the plasmid. Alternatively, several polyketides are naturally colored 

2 0 and this characteristic provides a built-in marker for screening cells successfully 
transformed by the present constmcts. 

The various PKS nucleotide sequences, or a cocktail of such sequences, can be 
cloned into one or more recombinant vectors as individual cassettes, with separate 
control elements, or under the control of, e.g., a single promoter. The PKS subunits 

25 or cocktail components can include flanking restriction sites to allow for the easy 
deletion and insertion of other PKS subunits or cocktail components so that hybrid 
PKSs can be generated. The design of such unique restriction sites is known to those 
of skill in the art and can be accomplished using the techniques described above, such 
as site-directed mutagenesis and PGR. 

30 As described above, particularly useful control sequences are those which 

themselves, or using suitable regulatory systems, activate expression during transition 
from growth to stationary phase in the vegetative mycelium. The system contained in 
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the illustrated plasmid pCK7, i.e., the actUactlll promoter pair and the ac/II-ORF4, an 
activator gene, is particularly preferred. Particularly preferred hosts are those which 
lack their own means for producing polyketides so that a cleaner result is obtained. 
Illustrative host cells of this type include the modified S, coelicolor CH999 culture 
5 described in PCT application WO 96/40968 and similar strains ofS. lividans. 

The expression vectors containing nucleotide sequences encoding a variety of 
PKS systems for the production of different polyketides are then transformed into the 
appropriate host cells to construct the library. In one straightforward z^proach, a 
mixture of such vectors is transformed into the selected host cells and the resulting 

1 0 cells plated into individual colonies and selected for successful transfoimants. Each 
individual colony will then represent a colony with the ability to produce a particular 
PKS synthase and ultimately a particular polyketide. Typically, there will be 
duplications in some of the colonies; the subset of the transformed colonies (hat 
contains a different PKS in each member colony can be considered the library. 

15 Alternatively, the expression vectors can be used individually to transform hosts, 
which transformed hosts are then assembled into a library. A variety of strategies 
might be devised to obtain a multiplicity of colonies each containing a PKS gene 
cluster derived firom the naturally occurring host gene cluster so that each colony in 
the library produces a different PKS and ultimately a different polyketide. The 

2 0 nimiber of different polyketides that are produced by the library is typically at least 

four, more typically at least ten, and preferably at least 20, more preferably at least 50, 
reflecting similar numbers of different altered PKS gene clusters and PKS gene 
products. The number of members in the library is arbitrarily chosen; however, the 
degrees of freedom outlined above with respect to the variation of starter, extender 

2 5 units, stereochemistry, oxidation state, and chain length is quite lairge. 

Methods for introducing the recombinant vectors of the present invention into 
suitable hosts are known to those of skill in the art and typically include the use of 
CaCh or other agents, such as divalent cations, lipofection, DMSO, protoplast 
transformation and electroporation. 

30 As disclosed in copending application Serial No. 08/989,332 filed 

1 1 December 1997, mcorporated herein by reference, a wide variety of hosts can be 
used, even though some hosts natively do not contain the ^ropriate post- 
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translational mechanisms to activate the acyl carrier proteins of the synthases. These 
hosts can be modified with the appropriate recombinant enzymes to effect these 
modifications. 

The polyketide producing colonies can be identified and isolated using known 
5 techniques and the produced polyketides further characterized. The polyketides 

produced by these colonies can be used collectively in a panel to represent a library or 
may be assessed individually for activity. 

The libraries can thus be considered at four levels: (1) a multiplicity of 
colonies each with a different PKS encoding sequence encoding a different PKS 

1 0 cluster but all derived firdm a naturally occurring PKS cluster; (2) colonies which 

contain the proteins that are members of the PKS produced by the coding sequences; 
(3) the polyketides produced; and (4) antibiotics derived Scorn the polyketides. Of 
course, combination libraries can also be constructed wherein members of a library 
derived, for example, firom the erythromycin PKS can be considered as a part of the 

1 5 same library as those derived fix)m, for example, the rapamycin PKS cluster. 

Colonies in the library are induced to produce the relevant synthases and thus 
to produce the relevant polyketides to obtain a library of candidate polyketides. The 
polyketides secreted into the media can be screened for binding to desired targets, 
such as receptors, signaling proteins, and the like. The supematants per se can be 

2 0 used for screening, or partial or complete purification of the polyketides can first be 
effected. Typically, such screening methods involve detecting the binding of each 
member of the library to receptor or other target ligand. Binding can be detected 
either dkectly or through a competition assay. Means to screen such libraries for 
binding are well known in the art. 

2 5 Alternatively, individual polyketide members of the library can be tested 

against a desired target. In this event, screens wherein the biological response of the 
target is measured can more readily be included. 

Indeed, a large number of novel polyketides have been prepared according to 
the method of the invention as illustrated in the examples below. These novel 

3 0 polyketides are useful intermediates in formation of compounds with antibiotic 

activity through glycosylation reactions as described above. As indicated above, the 
individual polyketides are reacted with suitable sugar derivatives to obtain compoxmds 
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of antibiotic activity* Antibiotic activity can be verified using typical screening 
assays such as those set forth in Lehrer, R. et al J Immunol Meth (1991) 137:167-173. 

New polyketides which are the subject of the invention are those described 
below. New antibiotics which are the subject of the invention include the 
glycosylated forms of these polyketides. 

In one embodiment, the polyketides of the invention include the compounds of 
structure (1) and the glycosylated forms thereof. Thecompounds include the 
polyketide structure: 



including the isolated stereoisomeric forms thereof; 

wherein R* is a straight chain, branched or cyclic, saturated or unsaturated 
substituted or unsubstituted hydrocarbyl of 1-15C; 

each of R' and R^ is independently H or alkyl (1-4C) wherein any alkyl at R^ 
may optionally be substituted; 
isHj, HOHor=0; 

with the provisos that: 

at least one of R^ and R^ must be alkyl (1-4C); and 
the compound is other than corapoimds 1, 2, 3, 5 and 6 of Figure 6A, 
Particularly preferred embodiments of foraiula (1) include compound 4 shown 
in Figure 6A, 

In another embodiment, the polyketides of the invention include the 
compoimds of formula (2) and the glycosylated forms thereof. These compoimds 
include the polyketide structure: 
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(2) 



including the isolated stereoisomeric forms thereof; 

wherein R* is a straight chain, branched or cyclic, saturated or unsaturated 
substituted or unsubstituted hydrocarbyl of 1-15C; 

each of R^ and R^ is mdependently H or alkyl (1-4C) wherein any alkyl at 
R' may optionally be substituted; 

each of and is independently H2, HOH or =0; 

with the provisos that: 

at least two of R\ R^ and R^ are alkyl (1-4C). 

hi another embodiment, the polyketides of the invention include the 
compounds of structure (3) and the glycosylated forms thereof Thecompoimds 
include the polyketide structure: 




including the isolated stereoisomeric forms thereof; 

wherein R* is a straight chain, branched or cyclic, saturated or unsaturated 
substituted or imsubstituted hydrocarbyl of 1-15C; 

each of R\ R^ and R^ is independently H or alkyl (1-4C) wherein any alkyl at 
R^ may optioiudly be substituted; 

each of X' and X^ is independently H2, HOH or =0; 

with the provisos that: 

at least one of R' and R^ must be alkyl (1-4C); and 
the compound is other than compound 8 of Figure 6 A. 
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The antibiotic forms of the polyketide of formula (3) arc the corresponding 
glycosylated forms. 

Still other embodiments are those of the following formula, including the 
glycosylated forms thereof These are derived from the compound of formula (4) 
which has the structure: 



including the isolated stereoisomeric forms thereof; 

wherein R* is a straight chain, branched or cyclic, saturated or unsaturated 
substituted or unsubstituted hydrocarbyl of 1-1 5C; 

each of R', R^ and R^ is independently H or alkyl (1-4C) wherein any alkyl at 
R' may optionally be substituted; 

each of X* and is independently H2, HOH or =0; 

with the proviso that: 

at least one of R^ and R^ is alkyl (MC); and 
the compound is other than compound 9 of Figure 6 A. 
Still other embodiments are the result of the condensation of five modules of 
the polyketide synthase system. The polyketide forms of these compounds are of the 




formula: 




X2 



r5 
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including the glycosylated and isolated stereoisomeric forms thereof; 

wherein R* is a straight chain, branched or cyclic, saturated or unsaturated 
substituted or unsubstituted hydrocarbyl of 1-1 5C; 

each of R\ R^, R^, R"* and R^ is independently H or alkyl (1-4C) wherein any 
alky! at R' may optionally be substituted; 

each of X\ X^ and X* is independently H2, HOH or =0; or 

X^orX^orX^or X^ is H and the compound of formula (5) contains a 7t-bond 
at positions 8-9 or 6-7 or 4-5 or 2-3; 

with the proviso that: 

at least two of R^-R^ are alkyl (1-4C); and 

the compound is other than compound 13 or 14 of Figure 6A or compoxmd 
205, 210-213 of figure 11, 

Preferred forms of compounds that include formula (5) as those wherein at 
least three, more preferably at least four, of R^-R^ are alkyl (1-4C), preferably methyl 
or ethyl 

Also preferred are compounds wherein X^ is -OH and/or X^ =0, and/or X^ is 



The glycosylated forms of these compounds are also useful antibiotics. 
Resulting from the condensation effected by six modules are the compounds 
which comprise the formula: 



H. 
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wherein R* is a straight chain, branched or cyclic, saturated or unsaturated 
substituted or unsubstituted hydrocarbyi of 1 - 1 5 C ; 

each of R'-R^ is independently H or alkyl (1-4C) wherein any alkyl at R^ may 
optionally be substituted; 
5 each of X^-X^ is independently H2, HOH or =0; or 

each of X^-X^ is independently H and the compound of formula (5) contains a 
ir-bond in the ring adjacent to the position of said X at 2-3, 4-5, 6-7, 8-9 and/or 10-11; 

with the proviso that: 

at least two of R*-R^ are alkyl (MC); and 
10 the compound is other than compound 1 7, 24 or 28 of Figure 6B, compound 

301-311 of Figure 12(A) or compounds 312-322 of Figure 12(B). 

Preferred compounds comprising formula 6 are those wherein at least three of 
R^-R^ are alkyl (1-4C), preferably methyl or ethyl; more preferably wherein at least 
four of R^-R^ are alkyl (1-4C), preferably methyl or ethyl. 
1 5 Also preferred are those wherein is H2, =0 or * * 'OH, and/orX^ is H, 

and/or X^ is OH and/or X^ is OH and/or X^ is OH. 

Particularly preferred are compounds of formulas 1 8-23, 25-27, 29-75 and 101 
and 1 13 of Figures 6B-6F. Also preferred are compounds with variable R* when R*- 
R^ are methyl, X^ is =0, and X\ X^ and X^ are OH examples of which are depicted in 
2 0 formulas 96-100 and 104-107 of Figures 6G and 6H. The glycosylated forms of the 
foreoging are also preferred. 

Other polyketides which resuh from the condensation catalyzed by six 
modules of a modular PKS include those of the formula: 




2 5 mcluding the glycosylated and isolated stereoisomeric forms thereof; 
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wherein R* is a straight chain, branched or cycHc, saturated or unsaturated 
substituted or unsubstituted hydrocarbyl of 1-15C; 

each of R^-R^ is independently H or alkyl (1-4C) wherein any alkyl at R^ may 
optionally be substituted; 

R*^ is alkyl (1-5C); 

each of and and X*^ is independently H2, HOH or =0; 

with the proviso that: 

at least two of R^-R"* are alkyl (1-4C). 

These and their corresponding glycosylated fonns are also included in the 
invention. 

Still others include those of the formula: 




including the isolated stereoisomeric forms thereof; 

wherein R* is a straight chain, branched or cyclic, saturated or unsaturated 
substituted or unsubstituted hydrocarbyl of 1-15C; 

each of R^-R^ is independently H or alkyl (1-4C) wherein any alkyl at R* may 
optionally be substituted; 

rMs alkyl (1-5C); 

xMsOHorH; 

each X* , X\ X^ and X^ is mdependently H2, HOH or =0; or X^ or X^ is H and 

the compound of formula (8) has a 7C-bond between positions 7-8 or 9-10, with 
the proviso that: 

if X^ is H, at least one of X^ and X* is HOH or =0. 

These and their corresponding glycosylated forms are also included in the 
invention. 

As above, the glycosylated forms are useful antibiotics. 
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As set forth above, R in the compounds of the invention may be substituted 
as well as unsubstituted. Suitable substituents include halo (F, CI, Br, I), N3, OH, 
O-alkyl (1-6C), S-alkyl (1-6C), CN, 0-acyl (1-7C), 0-aryl (6-lOC), 0-alkyl-aryl (7- 
14C), NH2.NH-alkyl (1.6C) and N-(alkyl)2. 

Suitable substituents on are selected from the same group as those for R* 
In addition, the substituents on R^ and R* may form a ring system such as an epoxide 
ring, or a larger heterocyclic ring including O, or N or S. Preferred substituents for R* 
and R' are halo, OH and NH2. Unsubstituted forms are also preferred. 

Particularly useful as antibiotics within the scope of the invention are 
compounds of formulas 82-93 as set forth in Figure 8 herein. 

Still another embodiment of the compoimds of the invention is set forth as 
compound 94 in Figure 9. Its glycosylated form, shown as compound 95, is useful as 
an antibiotic. 

Examples 

The following examples are intended to illustrate, but not to limit the 
invention. 

Materials and Methods 
General Techniques : 
Bacterial strains, plasmids, and culture conditions. S, coelicolor CH999 
described in WO 95/08548, published 30 March 1995 was used as an expression host. 
DNA manipulations were performed in EscAencA/a coli MCI 061. Plasmids were 
passaged through £. coli ET12567 {dam dcm hsdS Cm*) (MacNeil, DJ. JBacteriol 
(1988) 170:5607) to generate unmethylated DNA prior to transformation of 
5. coelicolor. E, coli strains were grown under standard conditions. S. coelicolor 
strains were grown on R2YE agar plates (Hopwood, D. A. et al. Genetic manipulation 
of Streptomyces. A laboratory manual. The John Innes Foundation: Norwich, 1985). 
pRM5, also described in WO 95/08548, includes a colEI replicon, an appropriately 
truncated SCP2* Streptomyces replicon, two acr-promoters to allow for bidirectional 
cloning, the gene encoding the acffl-0RF4 activator which induces transcription from 
act promoters during the transition from growth phase to stationary phase, and 
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appropriate marker genes. Engineered restriction sites facilitate the combinatorial 
construction of PKS gene clusters starting from cassettes encoding individual domains 
of naturally occurring PKSs. 

When pRM5 is used for expression of PKS, (i) all relevant biosynthetic genes 
5 are plasmid-bome and therefore amenable to facile manipulation and mutagenesis in 
£. coliy (ii) the entire library of PKS gene clusters can be expressed in the same 
bacterial host which is genetically and physiologically well-characterized and 
presumably contains most, if not all, ancillary activities required for in vivo 
production of polyketides, (iii) polyketides are produced in a secondary metaboUte- 

1 0 like manner, thereby alleviating the toxic effects of synthesizing potentially bioactive 
compoimds in vivo, and (iv) molecules thus produced undergo fewer side reactions 
than if the same pathways were expressed in wild-type organisms or blocked mutants. 

Manipulation of DNA and organisms. Polymerase chain reaction (PGR) 
was performed using Taq polymerase (Perkin Elmer Cetus) under conditions 

1 5 recommended by the enzyme manufacturer. Standard in vitro techniques were used 
for DNA manipulations (Sambrook, et al Molecular Cloning: A Laboratory Manual 
(Current Edition)). E. coli was transformed with a Bio-Rad E. Coli Pulsing apparatus 
using protocols provided by Bio-Rad 5. coelicolor was transformed by standard 
procedures (Hopwood, D. A. et al Genetic manipulation of Streptomyces, A 

2 0 laboratory manual The John Innes Foundation: Norwich, 1 985) and transformants 

were selected using 2 mL of a 500 ^ig/ml thiostrepton overlay. 

Preparation A 

Constmction of the Complete Erythromycin PKS Gene Cluster 
25 Recovery of the Erythromycin PKS Genes 

Although various portions of the erythromycin PKS gene cluster can be 
manipulated separately at any stage of the process of preparing libraries, it may be 
desirable to have a convenient source of the entire gene cluster in one place. Thus, 
the entire erythromycin PKS gene cluster can be recovered on a single plasraid if 

3 0 desired. This is illustrated below utilizing derivatives of the plasmid pMAK705 

(Hamilton et al JBacteriol (1989) 171:4617) to permit in vivo recombmation 
betwem a temperature-sensitive donor plasmid, which is enable of replication at a 
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first, permissive temperature and incapable of replication at a second, non-permissive 
temperature, and recipient plasmid. The eryA genes thus cloned gave pCK7, a 
derivative of pRM5 (McDaniel et al Science (1993) 262:1546). A control plasmid, 
pCK7f, was constructed to carry a frameshift mutation in eryAI, pCK7 and pCK7f 
5 possess a ColEl replicon for genetic manipulation in E. coli as well as a truncated 
SCP2* (low copy number) Streptomyces replicon. 

These plasmids also contain the divergent actUactUl promoter pair and actU- 
0RF4, an activator gene, which is required for transcription from these promoters and 
activates expression during the transition from growth to stationary phase in the 

1 0 vegetative mycehum. High-level expression of PKS genes occurs at the onset of the 
stationary phase of mycelial growth. The recombinant strains therefore produce the 
encoded polyketides as secondary metabolites. 

In more detail, pCK7 (Figure 4), a shuttle plasmid containing the complete 
eryA genes, which were originally cloned from pSl (Tuan et al Gene (1990) 90:21), 

1 5 was constmcted as follows. The modular DEBS PKS genes were transferred 

incrementally from a temperature-sensitive "donor" plasmid, i.e., a plasmid C25)able of 
repUcation at a first, permissive temperature and incapable of replication at a second, 
non-permissive temperature, to a "recipient" shuttle vector via a double recombination 
event, as depicted in Figure 5. A 25.6 kb Sphl fragment from pSl was inserted into 

20 the Sphl site of pMAK705 (Hamilton et al JBacteriol (1989) 171 :4617) to give 
pCK6 (Cm"^), a donor plasmid containing eryAII, eryAIJl and the 3' end oieryAL 
Rephcation of this temperature-sensitive pSClOl derivative occurs at 30°C but is 
arrested at 44''C. The recipient plasmid, pCK5 (Ap^ Tc^), includes a 12.2 kb eryA 
fragment from the eryAI start codon (Cafifrey et al FEBS Lett (1 992) 304:225) to the 

2 5 Xcml site near the beginning of eryAII^ a 1 .4 kb EcoRl-Bsml pBR322 fragment 

encoding the tetracycline resistance gene (rc), and a 4.0 kb Notl-EcoRl fragment from 
the end of eryAin, Pad, Ndel, and ribosome binding sites were engineered at the 
eryAI stBTt codon in pCK5. pCK5 is a derivative of pRM5 (described above). The 5* 
and 3' regions of homology are 4.1 kb and 4.0 kb, respectively. MC1061 E. coli was 

3 0 transformed with pCK5 and pCK6 and subjected to carbenicillin and chloramphenicol 

selection at 30'*C. Colonies harboring both plasmids (Ap^ Cm^) were then restreaked 
at 44°C on carbenicillin and chloramphenicol plates. Only cointegrates formed by a 
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single recombination event between the two plasmids were viable. Surviving colonies 
were propagated at SO'^C under carbenicillin selection, forcing the resolution of the 
cointegrates via a second recombination event. To enrich for pCK7 recombinants, 
colonies were restreaked again on carbenicillin plates at 44*^C. Approximately 20% 
5 of the resulting colonies displayed the desired phenotype (Ap^ Tc^, Cm^). The final 
pCK7 candidates were thoroughly checked via restriction mapping. A control 
plasmid, pCK7f, which contains a frameshift error in eryAI^ was constructed in a 
similar manner, pCK7 and pCK7f were transformed into E. coli ET12567 (MacNeil J 
Bacteriol (1988) 170:5607) to generate unmethylated plasmid DNA and subsequently 

1 0 moved into Streptomyces coelicolor CH999. 

Upon growth of CH999/pCK7 on R2YE medium, the organism produced 
abundant quantities of two polyketides. The addition of propionate (300 mg/L) to the 
growth medium resulted in approximately a two-fold increase in yield of polyketide 
product. Proton and NMR spectroscopy, in conjunction with propionic-l-'^C acid 

1 5 feeding experiments, confirmed the major product as 6dEB (>40 mg/L) (Figure 1 A). 

The minor product was identified as 8,8a-deoxyoleandolide (>10 mg/L) (Figure 1 A), 
which apparently originates from an acetate starter unit instead of propionate in the 
6dEB biosynthetic pathway. ^^€2 sodium acetate feeding experiments confirmed the 
incorporation of acetate into the minor product. Three high molecular weight proteins 

20 (>200 kDa), presumably DEBSl, DEBS2, and DEBS3 (Caffrey et aL FEES Lett 
(1992) 304:225), were also observed in crude extracts of CH999/pCK7 via SDS- 
polyacrylamide gel electrophoresis. No polyketide products were observed from 
CH999/pCK7f The inventors hereby acknowledge support provided by the 
American Cancer Society (IRG-32-34). 

25 

Example 1 

Preparation of Scaffolds for Replacing DEBS AT and KR Domains 
For each of the six modules of DEBS, a subclone was made containing 
endonuclease restriction sites engineered at selected boundaries of the acyltransferase 
3 0 (AT) and reduction (KR or DH/ER/KR) domains. The restriction sites were 

introduced into the subclones by PGR mutagenesis, A BatriHL site was used for the 5' 
boimdary of AT domains, a Pstl site was introduced between the AT and reductive 
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domains, and Abal was used at the 3' end of the reductive domain (see Figure 5). This 
resulted in the following engineered sequences (lowercase indicates engineered 
restriction site): 

Module 1 (pKOSOll-16) 

GCGCAGCAGgga/ccGTCTTCGTC 
CGCGTCTGGc/gcagCCGAAGCCG 
CCGGCCGAA/ctogflGTGGGCGCG 



5' AT boimdary 
AT/KR boundary 
3' KR boundary 



Module 2 (pKOSOOl-11) 

5' AT boundary 
AT/KR boundary 
3' KR boimdary 



TCCGACGGTggfltocGTGTTCGTC 
CGGTTCTGGc/gcagCCGGACCGC 
ACGGAGAGCfcto^oGACCGGCTG 



Module 3 (PKOS024-2) 

5' AT boundary 
AT/KR boundary 
3' KR boimdary 



GACGGGCGCg^fflfccGTCTTCCTG 
CGCTACTGGc/gcagCCCGCCGCA 
ACCGGCGAG/ctogaCAACGGCTC 



Module 4 (PKOS024-3) 

5' AT boundary 
AT(DH/ER/KR) boundary 
3' DH/ER/KR boundary 



GCGCCGCGCsga/ccGTCCTGGTC 
CGCTTCTGGrtgco^CGCACCGG 
GGGCCGAAactog^aGACCGGCTC 



Module 5 (PKOS006-182) 

5' AT boundary 
AT/KR boundary 
3' KR boundary 



ACTCGCCGCggatecGCGATGGTG 
CGGTACTGGcfgcagATCCCCACC 
GAGGAGGGCtetogaCTCGCCCAG 



Module 6 (PKOS015-52) 



5' AT boundary 
AT/KR boundary 
3' KR boundary 



TCCGCCGGCggatecGTTTTCGTC 

CGGTACTGGrtgeagCCGGAGGTG 

GTGGGGGCactogaGCGGTGCAG 
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Example 2 

Preparation of Cassettes from the Rapamycin PKS 
A cosmid library of genomic DNA from Streptomyces hygroscopicus ATCC 
29253 was used to prepare DNA cassettes prepared from the rapamycin PKS gene 
5 cluster to be used as replacements into the enzymatic activity regions of the 

erythromycin gene cluster. Cassettes were prepared by PCR amplification from 
appropriate cosmids or subclones using the primer pairs listed in Table 1, and were 
designed to introduce suitable restriction sites at the ends of the cassettes. The 
rapAT2 cassette is flanked by Bglil and Pstl sites, and the rapAT14 cassette is flanked 
10 by BamHl and Pstl sites. The reductive cycle cassettes are flanked by Pstl and Xbal 
sites. Large DH/ER/KR cassettes were amplified in two pieces, then joined at an 
engineered site in order to minimize errors introduced during PCR amplification 
of long DNA sequences. The rapKR4 cassette was made by cloning a 1 .3kb 
NheUXbal fragment from the rapDH/KR4 cassette above into the site in pUC19. 
1 5 There is a Pstl site that is in-frame and upstream of Xbal in pUC19 that generates the 
following junction at the 5 -end of the cassette: 
5'-crgcajgGTCGAC TCTAGC CTGGT. . . 



Table 1 

Primer pairs used for PCR amplification of rapamycin PKS cassettes. 
All primers are listed from 5' to 3*. Engineered restriction sites are lower case. 


Module 


Primer 


Sequence 


rapAT2 


forward: 
Reverse: 


TTTa^arcfGTGTTCGTCTTCCCGGGT 
TTTcf^ca^CCAGTACCGCTGGTGCTGGAAGGCGTA 


rapAT14 


Forward: 
Reverse: 


TTTflr^afccGCCTTCCTGTTCGACGGGCAAGGC 
TTTcrsrca^CCAGTAGGACTGGTGCTGGAACGG 


rapKR2 


Forward: 
Reverse: 


TTTcf^caflrGAGGGCACGGACCGGGCGACTGCGGGT 
TTTfctt^aACCGGCGGCAGCGGCCCGCCGAGCAAT 


rapDH/KR4 


Forward: 
Reverse: 


TTcrflrcaflrAGCGTGGACCGGGCGGCT 
TTTfcfasraGTCACCGGTAGAGGCGGCCCT 


rapDH/ER/KRI 
(left half) 


Forward: 
Reverse: 


TTTcrflrcaflrGGCGTGGACCGGGCGGCTGCC 
TTTcfcflraflrCACCACGCCCGCAGCCTCACC 


rapDH/ER/KRI 
(right half) 


Forward: 
Reverse: 


TTTcfcflraflfGTCGGTCCGGAGGTCCAGGAT 
TTTfctosraATCACCGGTAGAAGCAGCCCG 
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Example 3 

Replacement of DEBS Modules By Rapamycin PKS Cassettes 
The following are typical procedures. The products are indicated by their 
numbers in Figure 6, where "a" represents the embodiment where R is methyl; **b" 
5 represents the embodiment where R is hydrogen. 

a) Replacement of DEBS DH/ER/KR4. A portion of the erythromycin 
gene of module 4 (eryDH/ER/KR4) was replaced either with the corresponding 
rapamycin activities of the first rapamycin module (rapDH/ER/KRl) or of module 4 
of rapamycin (rapDH/KR4). The replacement utilized the technique of Kao et aL 

1 0 Science ( 1 994) 265 :509-5 12, A donor plasmid was prepared by first amplifying 1 kbp 
regions flanking the DH/ER/KR4 of DEBS to contain a PstI site at the 3* end of the 
left flank and an JCbal site at the 5* end of the right flank. The fi*agments were ligated 
into a temperature-sensitive donor plasmid, in a manner analogous to that set forth for 
KR6 in paragraph b) of this example, and the rapamycin cassettes prepared as 

1 5 described in Example 2 were inserted into the PstVXbal sites. The recipient plasmid 
was pCK7 described in Preparation A. The in vivo recombination technique resulted 
in the expression plasmid pKOSOl 1-19 (eryDH/ER/KR4 -> rapDH/ER/KRl) and 
pKOSOl 1-21 (eryDH/ER/KR4 -> rapDH/KR4). The junctions at which the Pstl and 
Xbal sites were introduced into DEBS in both vectors are as follows: 

20 

GAGCCCCAGCGGTACT6GCTGCA6 rap cassette 
TCTAQAGCGGTGCAGGCGGCCCCG 

The resulting expression vectors were transformed into S. coelicolor CH999 
2 5 and successful transformants grown as described above. The transformant containing 
the r^DH/ER/KRl cassette produced the polyketide shown in Figure 6 as 23a,b; the 
transformant containing the plasmid with rapDH/KR4 cassette produced the 
polyketide shown in Figure 6 as 24a,b. As shown, these polyketides differ from 
6-deoxyerythronolide B by virtue of a 6,7 alkene in the case of 24a and by the C6- 
30 methyl stereochraiistry in the case of 23a. 

b) Replacement of DEBS KR6. In a manner analogous to that set forth in 
paragraph a), plasmid pKOSOl 1-25, wherein eryKR6 was replaced by rapDH/KR4, 
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was prepared by substituting regions flanking the KR6 domain of DEBS in 
construction of the donor plasmid. 

Approximately 1 kb regions flanking the eryKR6 domain were PCR amplified 
with the following primers: 



left flank 


forward 


5*-TTTGGATCCGTTTTCGTCTTCCCAGGTCAG 


reverse 


5'-TTTCTGCAGCCAGTACCGCTGGGGCTCGAA 


right flank 


forward 


5'-TTTTCTAGAGCGGTGCAGGCGGCCCCGGCG 


reverse 


5'-AAAATGCATCTATGAATTCCCTCCGCCCA 



These fragments were then cloned into a pMAK705 derivative in which the 
multiple cloning site region was modified to accoimnodate the restriction sites of the 
fragments (i.e., BarnHUPstl for the left flank and XbaVNsil for the right flank). 
Cassettes were then inserted into the PstVXbal sites of the above plasmid to generate 
donor plasmids for the in vivo recombination protocol The resulting Pstl and Xbal 
jimctions engineered into DEBS are as follows: 

GAACACCAGCGCTTCTGQCTOCAO rap cassette 
TCTAOAaACCGOCTCGCCGGTCGQ 

Regions flanking the KR6 domain of DEBS were used to construct the donor 
plasmids. 

Transformants of S. coelicolor CH999 resulted in the production of the 
polyketide shown in Figure 6 as 74a,b. 

c) Replacement of DEBS KR2* The eryKR2 enzymatic acti vity was 
replaced in a series of vectors using in vitro insertion into the PstVXbal sites of 
pKA0263. pKA0263 is a derivative of pCK13 described in Kao, CM. J Am Chem 
Soc (1996) 118:9184-9185. It was prepared by introducing thePM wdXbal 
restriction sites positioned identically to those in the analogous 2-module DEBS 
system described by Bedford, D. et al Chem an Biol (1996) 3:827-831. Three 
expression plasmids were prepared: pKOS009-7 (eryKR2 -> r^DH/KR4); 
pKA0392 (eryKR2 rapKR2); and pKAO410 (eryKR2 -> rapDH/ER/KRl). these 
plasmids, when transformed mto 5. coelicolor CH999, resulted in the production of 
polyketides with the structures 12a,b; 3a,b; and 10a, lla,b in Figure 6, respectively. 



wo 98/49315 



-32- 



PCT/US98/08792 



An additional vector, pKAO400 (eryKR2 rapKR4) produced the same results as 
PKA0392. 

d) Replacement of DEBS AT2» The DEBS AT activity from module 2 
was excised after inserting restriction sites BaniHl and Pstl flanking the AT module 2 
domain into pCK12 (Kao et al J Am Chem Soc (1995) 1 12:9105-9106), After 
digestion with BamHUPstl, the BglWPstl fragment containing rapAT2 was inserted. 
The resulting engineered DEBS/r^AT2 junction is as follows (BamEI/Bgni ligation - 
GGATCT; Pstl - CTGCAG): 

AGTGCCTCCGACGGTGGATCT rapAT2 CTGCAGCCGGACCGCACCACCCCT 

iS. coelicolor CH999 transformed with the resulting plasmid, pKOS008-51, 
produced the polyketides 6a,b shown in Figure 6. 



Example 4 

Excision of DEBS Reductive Cycle Domains 
The following is a typical procedure. The products are indicated by their 

numbers in Figure 6, where "a" represents the embodiment where R is metiiyl; **b" 

represents the embodiment where R is hydrogen. 

A duplex oligonucleotide Imker (ARdx) was designed to allow complete 

excision of reductive cycle domains. Two synthetic oligonucleotides: 

5 * "GCCGGACCGCACCACCCCTCGTGACGGAGAACCGGAGACGGAGAGCT- 3 » 

AQSICGGCCTGGCGTGGTGGGGAGCACTGCCTCTTGGCCTCTGCCTCTCGAGATC - 5 * 
Pstl Xbal 

were designed to generate PstV and-¥&aI-compatible ends upon hybridization. This 
duplex linker was ligated into the Pstl- and Jftal-sites of the recombination donor 
plasmid containing the appropriate left- and right-flanking regions of the reductive 
domain to be excised. The in vivo recombination technique of Example 3, paragraph 
a) was then used. The donor plasmid contained the duplex linker ARdx having a Pstl 
and Xbal compatible end ligated into the Pstl and Xbal sites of the plasmid modified 
to contain the left and right flanking regions of the reductive domain to be excised. 
The donor plasmids were recombined with recipient plasmid pCK7 to generate 
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pKOSOl 1-13 (eryKR6 ARdx) and with recipient plasmid pCK13 to obtain 
pKOS005-4 (eryKR2 ARdx). When transformed into S. coelicolor CH999, 
plasmid pKOSOl 1-13 produced the polyketides 30a,b, 31a,b, 77a,b and 78a,b; in 
Figure 6 plasmid pKOS005-4 produced the polyketide 2a,b. 

5 

Example 5 

Manipulation of Macrolide Ring Size by Directed Mutagenesis of DEBS 

The following are typical procedxires. The products are indicated by their 
numbers in Figure 6, where "a" represents the embodiment where R is methyl; "b" 
1 0 represents the embodiment where R is hydrogen. 

Using the expression system of Kao, C. M. et al Science (1994) 265:509-512, 
the expression of DEBSl alone (1 + 2), in the absence of DEBS2 and DEBS3 (in 
plasmid pCK9), resulted in the production of (2R,3S,45,5i?)-2,4-dimethyl-3,5- 
dihydroxy-«-heptanoic acid L-lactone ("the heptanoic acid L-lactone" (la) (see 
1 5 Figures 6 and 7)) (1-3 mg/L), the expected triketide product of the first two modules 
(Kao, C. M. et al J Am Chem Soc (1994) 1 16:1 1612-11613). Thus, a thioesterase is 
not essential for release of a triketide from the enzyme complex. 

Two additional deletion mutant PKS were constmcted. The first contained 
DEBSl fused to the TE, and the second PKS included the first five DEBS modules 

2 0 fused to the TE. Plasmids pCK12 and pCK15 respectively contained the genes 

encoding the bimodular ("1+2+TE") and pentamodular ("l-f-2+3+4+5+TE") PKSs. 

The H-2+TE PKS in pCK12 contained a fusion of the carboxy-terminal end of 
the acyl carrier protein of module 2 (ACP-2) to the carboxy-terminal end of the acyl 
canier protein of module 6 (ACP-6). Thus ACP-2 is essentially intact and is followed 
25 by the amino acid sequence naturally foxmd between ACP-6 and the TE. Plasmid 
pCK12 contained eryA DNA originating from pSl (Tuan, J. S. et al. Gene (1990) 
90:21). pCK12 is identical to pCK7 (Kao et al Science (1994), supra) except for a 
deletion between the carboxy-tenninal ends of ACP-2 and ACP-6. The fusion occurs 
between residues L3455 of DEBSl and Q2891 of DEBS3. An Spel site is present 

3 0 between these two residues so that the DNA sequence at the fusion is 

CTCACTAGTCAG. 
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The 14-2+3+4+5+TE PKS in pCK15 contained a fusion 76 amino acids 
downstream of the p-ketoreductase of module 5 (KR-5) and five amino acids 
upstream of ACP-6. Thus, the fusion occxirs towards the carboxy-terminal end of the 
non-conserved region between KR-5 and ACP-5, and the recombinant module 5 was 
5 essentially a hybrid between the wild type modules 5 and 6* Plasmid pCKl 5 

contained eryK DNA originating from pSl (Tuan et ai Gene (1990), supra). pCKlS 
is a derivative of pCK7 (Kao et al Science (1994), supra) and was constructed using 
the in vivo recombination strategy described earlier (Kao et al Science (1994), supra). 
pCK15 is identical to pCK7 with the exception of a deletion between KR-5 and 
1 0 ACP-6, which occurs between residues G1372 and A2802 of DEBS3, and the 

insertion of a blunted a Sail fragment containing a kanamycin resistance gene (Oka A. 
et al. JMol Biol (1981) 147:217) into the blunted Hindm site of pCK7, An arginine 
residue is present between G1372 and A2802 so that the DNA sequence at the fusion 
is GGCCGCGCC, 

1 5 Plasmids pCK12 and pCKl 5 were introduced into 5". coelicolor CH999 and 

polyketide products were purified from the transformed strains according to methods 
previously described (Kao et al Science (1994), supra). The products obtained from 
various transformants: CH999/pCK12 and CH999/pCK15 as well as CH999/pCK9 
described above, are shown in Figure 7. 

2 0 CH999/pCK12 produced the heptanoic acid L-lactone (1 a) (20 mg/L) as 

determined by and ^^C NMR spectroscopy. This triketide product is identical to 
that produced by CH999/pCK9, which expresses the unmodified DEBSl protein 
alone described above. However, CH999/pCK12 produced 6Ba in significantly 
greater quantities than did CH999/pCK9 (>10 mg/L vs. -1 mg/L), indicating the 

2 5 ability of the TE to catalyze thiolysis of a triketide chain attached to the ACP domain 

of module 2. CH999/pCK12 also produced significant quantities of lb, a novel 
analog of la, (10 mg/L), that resulted firom the incorporation of an acetate start unit 
instead of propionate. This is reminiscent of the ability of CH999/pCK7, which 
expresses the intact PKS, to produce 8,8a-deoxyoleandolide (see Figure 1 A) m 

3 0 addition to 6dEB described above. 

Since lb was not detected in CH999/pCK9, its facile isolation from 
CH999/pCK12 provides additional evidence for the increased turnover rate of DEBSl 
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due to the presence of the TE. In other words, the TE can effectively recognize an 
intermediate bound to a "foreign" module that is four acyl units shorter than its natural 
substrate, 6dEB. However, since the triketide products can probably cyclize 
spontaneously into la and lb under typical fermentation conditions (pH 7), it is not 
5 possible to discriminate between a biosynthetic model involving enzyme-catalyzed 
lactonization and one involving enzyme-catalyzed hydrolysis followed by 
spontaneous lactonization. Thus, the ability of the 1+2+TE PKS to recognize the C-5 
hydroxyl of a triketide as an incoming nucleophile is unclear. 

CH999/pCK15, produced abundant quantities of (8ii,9S)-8,9-dihydro-8- 

10 methyl-9-hydroxy-lO-deoxymethonolide (compound 13 m Figure 6) (10 mg/L), 

demonstrating that the pentamodular PICS is active. Compound 13 was characterized 
using and ^^C NMR spectroscopy of natural abxmdance and ^^C-enriched material, 
homonuclear correlation spectroscopy (COSY), heteronuclear correlation 
spectroscopy (HETCOR), mass spectrometry, and molecular modeling. Compound 

15 13 is an analog of 10-deoxymethonolide (compound 14, Lambalot, R.H. et ai J 
Antibiotics (1992) 45:1981-1982), the aglycone of the macrolide antibiotic 
methymycin. The production of 13 by a pentamodular enzyme demonstrates that 
active site domains in modules 5 and 6 in DEBS can be joined without loss of 
activity. Thus, it appears that individual modules as well as active sites are 

2 0 independent entities which do not depend on association with neighboring modules to 
be functional The 12-membered lactone ring, formed by esterification of the terminal 
carboxyl with the C-1 1 hydroxyl of the hexaketide product, indicated the ability of the 
1+2+3+4+5+TE PKS, and possibly the TE itself, to catalyze lactonization of a 
polyketide chain one acyl imit shorter than the natural product of DEBS, 6dEB. 

2 5 Indeed, the formation of the 1 3 may mimic the biosynthesis of the closely related 

12-membered hexaketide macrolide, methymycin, which frequently occm with the 
homologous 14-membered heptaketide macrolides, picromycin and/or narbomycin 
(Cane, D. E. et al J Am Chem Soc (1993) 1 15:522-566). The erythromycin PKS 
scaffold can thus be used to generate a wide range of macrolactones with shorter as 

3 0 well as longer chain lengths. 

The construction of the l+2-h3+4+5+TE PKS resulted in the biosynthesis of a 
previously uncharacterized 12-membered macrolactone that closely resembles, but is 
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distinct from, the aglycone of a biologically active macrolide. The apparent structural 
and functional independence of active site domains and modules as well as relaxed 
lactonization specificity suggest the existence of many degrees of freedom for 
manipulating these enzymes to produce new modular PKSs. 

5 

Example 6 

Production and Analysis of Polyketide Products 
The expression vectors created by domain substitution in DEBS, as described 
in Examples 1-5, were transformed into either Streptomyces coelicolor CH999 or 

10 S, lividans K4-1 14 using standard techniques (D.A. Hopwood et al. (1985) "Genetic 
Manipulation of Streptomyces: A Laboratory Manual," (The John Innes Foundation, 
Norwich)). Both host strains have complete deletions of the native actinorhodin 
polyketide synthase gene cluster and so produce no native polyketide products. 
Transformants were grown on 150 mm R2YE agar plates for 2 days at SO'^C, at which 

1 5 time the agar slab was lifted from the dish and placed in a new dish which contained a 
layer of 4 mm glass beads, 50 mL of liquid R2 YE medium supplemented with 5 mM 
sodium propionate, and ca, 1 g of XAD-16 resin beads. This was kept at SO'^C for an 
additional 7 days. 

The XAD-16 resin was collected by vacuxun filtration, washed with water, 
2 0 then extracted twice with 1 0 mL portions of ethanol. The extracts were combined and 
evaporated to a slurry, which was extracted with ethyl acetate. The ethyl acetate was 
washed once with sat NaHCOa and evaporated to yield the crude product. Samples 
were dissolved in ethanol and analyzed by LC/MS. The HPLC used a 4.6 x 150 mm 
CI 8 reversed-phase column with a gradient from 80:19:1 H2O/CH3CN/CH3CO2H to 
25 99: 1 CH3CN/CH3CO2H. Mass spectra were recorded using a Perkin-Elmer/Sciex 
APIIOOLC spectrometer fitted with an APCI ion source. Each genetic construct 
typically resulted in formation of products in pairs, indicated in the Figure 6 and in 
Table 2 by the letters "a" (R ^ CH3) and "b'* (R = H). arising from priming of the 
PKS by and propionyl-CoA and acetyl-CoA, respectively. 
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Additional Examples 

Using the foregoing techniques, the DEBS constructs shown in Table 2 were 
prepared. The products obtained when the constructs were transformed into 
S, coelicolor CH999 are indicated by their numbers in Figure 6, where "a" represents 
5 the embodiment where R is methyl; * V represents the embodiment where R is 

hydrogen. Some of the expression vectors were prepared by in vitro ligation; multiple 
domain substitutions were created by subsequent in vitro ligations into the singly- 
substituted expression plasmids. Others were obtained by in vivo recombination. 



Table 2 


Plasmid Modules Domain Substitution 


Products (see Figure 6) 


In Vhro Ligation 


KOS011-28 


2 


eryATI -> rapAT2 


4-nor-TKL (5a,b) 


KOS008-51 


2 


eryAT2 -> rapAT2 


2.nor-TKL (6a.b) 


KOS014-62 


2 


eryKR2 rapDH/ER/KR1 


3-deoxy-TKL (4a,b) 


KAO410 


3 


eryKFtt -> rapDH/ER/KR1 


KAO410(10a,b) 
3-deoxy-hemiketal (11a,b) 


KA0392 

1 ^^^^^ Jb» 


3 


eryKR2 — > rapKR2 




KOS009-7 


3 


eiryKiR2 rapDH/KR4 


KOS009.7(12a.b) 


KOS015-30 


6 


eryATS rapAT2 


8-nor-6dEB (18a.b) 


KOS016-47 


6 


eryATS rapAT2 


4-nor-6dEB(19a,b) 


KOS026-18b 


6 


eryKR5 rapDH/ER/KRI 


5-deoxy-6dEB (26a.b) 


KOS016-32 


6 


eryKR5->rapDH/KR4 


4.5-dehytlro-6dEB (27a,b) 


KOS016-28 


6 


eryKR5 ARdx 


5-oxo-6dEB (28a.b) 


KOS015-63 


6 


eryATS -> rapAT2 


2-nor-6dEB (20a,b) 


KOS015-83 


6 


eryAT2 -> rapAT2 + 
eryKR2 rapDH/KR4 


10-nor-10,11-dehydro-6dEB {32a,b) 


KOS015-84 


6 


eryAT2 rapAT2 + 
eryKR2 -> rapDH/ER/KRI 


10-nor-11-deoxy-6dEB (33a,b) 


KOS016-100 


6 


eryATS rapAT2 + 
eryKRS -> Ardx 


4-nor-5-oxo-6dEB (38a.b) 


KOS015-106 


6 


eiyAT6-yrapAT2 + 
eryKRB -> rapKR2 


2-nor-3-epi-6dEB (42a.b) 


KOS015-109 


6 


eryAT6 -> rapAT2 + 
eryKR6 Ardx 


2-nor-3-oxo-6dEB (31 a,b) 


KOS011-90 


6 


efyAT2 -> rapAT2 + 
eryKRS ->rapDH/KR4 


4.5-dehydro.10-nor-6dEB (34a.b) 


KOS011-84 


6 


eryAT2 -> rapAT2 + 
eryKRS -> Ardx 


5-0X0-1 0-nor-SdEB (35a.b) 
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TabI 2 


Plasmid 


Modules 


Domain Substitution 


Products (see Figure 6) 


KOS011-82 


6 


eryKR2 rapDH/KR4 + 
eryATS rapAT2 


4-nor-10,11-dehydro-6dEB (39a,b) 


KOS011-85 


6 


eryKR2 rapDH/KR4 + 
eryKRS -> Ardx 


5-oxo-10.11-dehydro-6dEB (57a.b) 


KOS011-87 


6 


eryKR2 ^ rapDH/KR4 + 
eryAT5 -> rapAT2 + 
eryKRS Ardx 


4-nor-5-oxo-10,1 1-dehydro-SdEB 
(65a,b) 


KOS011-83 


6 


eryKR2 rapDH/ER/KRI 
+ eryATS -> rapAT2 


4-nor-11-deoxy-SdEB (40a.b) 


KOS011-91 


6 


eryKR2 -> rapDH/ER/KR1 
+ eryKRS rapDH/KR4 


4,5-dehydro-11-deoxy-SdEB (55a,b) 


KOS011-86 


6 


eryKR2 -> rapDH/ER/KR1 
+ eryKRS Ardx 


5-oxo-11-deoxy-SdEB (56a,b) 


KOS011-88 


6 


eryKR2 -> rapDH/ER/KR1 
+ eryATS rapAT2 
eryKRS -> Ardx 


4-nor-5-oxo-1 1-deoxy-6dEB (SSa.b) 


KOS015-40 


6 


eryAT2 rapAT2 + 
eryKRS -> rapDH/KR4 


2,3-dehydro-10-nor-6dEB (76a.b) 


KOS015-41 


6 


eryAT2 rapAT2 + 
eryKRS -> Ardx 


3-0X0-1 0-nor-SdEB {3Sa,b) 
10-nor-spiroketal (79a.b) 


KOS015-44 


6 


eryKR2 rapDH/ER/KR1 
+ eryATS -> rapAT2 


2-nor-1 1-deoxy-6dEB (45a,b) 


KOS015-45 


6 


eryKR2 -> rapDH/ER/KR1 
+ eryKRS -> RapDH/KR4 


2,3-dehydro-11-deoxy-6dEB {75a.b) 


KOS015-46 


6 


eryKR2 rapDH/ER/KR1 
+ eryKRS -> Ardx 


3-0XO-1 1-deoxy-SdEB (53a.b) 


KOS015-42 


6 


eryKR2 rapDH/KR4 + 
eryATS rapAT2 


2-nor-10.11-dehydro-6dEB (4Sa,b) 


KOS015-43 


6 


eryKR2 -> rapDH/KR4 + 
eryKRS -> Ardx 


3-0X0-10,1 1-dehydro-SdEB (54a.b) 


KOS015-88 


6 


eryKR2 -> rapDH/KR4 + 
eryKRS -> rapKR2 


3-epi-10,11-dehydro-SdEB (48a,b) 


KOS015-89 


6 


eryKR2 ^ rapDH/ER/KR1 
+ eryKRS -> rapKR2 


3-epi-11-deoxy-6dEB (49a,b) 


KOS015-87 


6 


eryAT2 rapAT2 + 
eryKRS -> rapKR2 


3-0X0-1 0-nor-SdEB (36a.b) 


KOS015-117 


6 


eryAT2--^rapAT14 + 
eryATS ^ rapAT2 


2,10-bisnor-6dEB {37a.b) 


KOS01 5-120 


6 


eryAT2 rapATU + 
eryATS rapAT2 + 


2.10-bisnor-3-oxo-SdEB (58a.b) 
2,10-bisnor-spiroketal (8(^,b) 
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Table 2 


Plasmid 


Modules 


Domain Substitution 


Products (see Figure 6) 






eryKR6 -> Ardx 




KOS015-121 


6 


eryKR2 rapDH/KR4 + 
eryAT6 -> rapAT2 + 
eryKRS -> rapKR2 


2-nor-3-epi-10.1 1-dehydro-SdEB 
(62a.b) 


KOS015-122 


6 


eryKR2 rapDH/KR4 + 
eryAT6 rapAT2 + 
eryKRB -» Ardx 


2-nor-3-oxo-1 0, 1 1 -dehydro-6d EB 
(63a.b) 


KOS015-123 


6 


efyKR2 ^ rapDH/ER/KR1 
+ eryAT6 rapAT2 + 
eryKRB -> rapKR2 


2-nor-3-epi-11-deoxy-6dEB (66a,b) 


KOS015-125 


6 


eryKR2 -> rapDH/ER/KRI 
+ eryATB -> rapAT2 + 
eryKRS Ardx 


2-nor-3-oxo-11-deoxy-6dEB (67a.b) 


KOS01 5-127 


6 


eryAT2 -> rapAT2 + 
eryKR2 rapDH/KR4 + 
eryKR6 rapKR2 


3-epi-10-nor-10,1 1-dehydro-SdEB 
(64a.b) 


KOS015-150 


6 


eryAT2 rapAT2 + 
eryKR2 -> rapDH/KR4 + 
eryAT6 rapAT2 


2,10-bisnor-10,1 1-dehydro-6dEB 
(59a,b) 


KOS015-158 


6 


eryAT2 -> rapAT2 + 
eryKR2 rapDH/ER/KR1 
•f eryKRS -> Ardx 


3-0X0-1 0-nor-11-deoxy-6dEB (68a.b) 




O 


eryAT2 -> rapAT2 + 
eryKR2 rapDH/ER/KR1 
+ eryATS -> rapAT2 


2,10-bisnor-11-deoxy-6dEB (SOa.b) 


l\UoUiD-1ool\ 


6 


eryKRS rapDH/KR4 + 
eryKRS Ardx 


3-oxo-4.5-denydro-6dEB (51a,b) 
3,5-dioxo-6dEB (52a.b) 


INWOUJ 0-1 DUB 


O 


eryKRS Ardx + 
eryKRS rapKR4 


S-eph-o-oxo-odEB (50a,b) 


KOS016-183F 


6 


eryATS rapAT2 + 
eryATS ^ rapAT2 


2,4-bi8nor-6dEB(41a,b) 


KOS016-183G 


6 


eryATS -> rapAT2 + 
eryATS rapAT2 + 
eryKRS -> rapKI^ 


2,4-bisnor-3-epl-6dEB (61a.b) 


KOS016-152E 


6 


eryKRS -^rapDH/KR4 + 
eryATS rapAT2 


2-nor-4,5-dehydro-6dEB (43a,b) 


KOS016-152F 


6 


eryKRS -» rapDH/KR4 + 
eryATS rapAT2 + 
ryKRS -» rapKR2 


2-nor-3-epi-4,5-dehydFO-6dEB 
(70a,b) 


KOS016-152G 


6 


eryKRS -►rapDHrt<R4 + 
eryATS -> rapAT2 + 


2-nor-3-ox(>4,5-dehydro-6dEB 
(71a.b) 
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Table 2 


Plasmid 


Modules 


Domain Substitution 


Products (see Figure 6) 






eryKRS -> Ardx 


hemiketal (81a.b) 


KOS016-152K 


6 


eryKRS -> Ardx + 
eryAT6 rapAT2 


2-nor-5-oxo-6dEB {44a,b) 


KOS016-152I 


6 


eryKRS Ardx + 
eryATG rapAT2 + 
eryKRS rapKR2 


2-nor-3-epi-5-oxo-6dEB (72a,b) 


KOS015-34 


6 


eryAT3 -> rapAT2 + 
eryAT6 rapAT2 


2,8-bisnor-6dEB (47a.b) 


KOS01 5-162 


6 


eryKR3 -> rapDH/ER/KR1 
+ eryKRS Ardx + 
eryAT6 -> rapAT2 


2-nor-6-oxo-11-deoxy-6dEB (73a.b) 


In Vivo Ligation 


KOS005-4 


3 


KR2 -> ARdx 


3-keto.TKL (2a.b) 


KOS011-62 


6 


AT2 -> rapAT2 


10-nor-6dEB (17a,b) 


KOS011-66 


6 


KR2 rapDH/ER/KR1 


11-deoxy-6dEB(21a»b) 


KOS011-64 


6 


KR2 -> rapDH/KR4 


10,11-dehydro-6dEB (22a.b) 


KOS011-19 


6 


DH/ER/KR4^ 
rapDH/ER/KR1 


6-epi-6dEB (23a,b) 


KOS011-21 


6 


DH/ER/KR4-> 
rapDH/KR4 


6.7-dehydro-6dEB(24a.b) 


KOS011-22 


6 


DH/ER/KR4->ARdx 


7-oxo-6dEB {25a.b) 


KOS011.74 


6 


KR6->rapKR2 


3-epi-6dEB (29a,b) 


KOS011-25 


6 


KR6 -> rapDH/KR4 


2,3-dehydro-6dEB (74a.b) 


KOS011.13 


6 


KR6->ARdx 


3-oxo-6dEB (30a.b) 
2-nor-3-oxo-6dEB(31 a,b) 
spiroketal {77a.b) 
2-nor-spiroketat (78a,b) 



Example 7 

Preparation of 14,15*dehydro-6-deoxyerythronolide B (Compound 94 of Figure 9) 
A 3 day culture of S, coelicolor CH999/pJRJ2 grown on 3 100-nun R2YE agar 
plates was overlayed with a solution of 10 mg of (2S,3R)-3-hydroxy-2-methyl-4- 
pentenoic acid N-acetylcysteamine thioester dissolved in 2 mL of 9: 1 water/DMSO 
and allowed to dry. The culture was incubated at SO^'C for an additional 4 days. The 
agar was chopped and extracted twice with an equal volume of ethyl acetate. The 
extracts were combined and evaporated. Purification by silica gel chromatogr^hy 
(1:1 ethyl acetate/hexanes) yielded 0.75 mg of 14,15-dehydro-6-deoxyerythronolide 
B, compound 94 in Figure 9, APCI-MS gives [M+H]+ = 385. 



5 



10 
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Analogous compounds with variations in R* and/or as represented by 
compounds 96-107 and compound 1 13 of figures 6G and 6H are prepared in a similar 
manner as described m the previous paragraph but substituting the appropriate 
diketide as the N-acetylcysteamine thioester. These compounds are prepared in this 
5 manner and their structures verified. 

The preparation of the ^ropriate derivatized diketides is described in 
Example 17, 

Example 8 

10 Synthesis of l-(2-mercaptopyrimidinyl)-2-0-methoxycarbonyl-(D)-desosaniine 

For the glycosylation reactions in the following examples, the title compound 
was used as a reagent. The conversions of paragraph (A) and (B) of this Example are 
shown in Figure 9* 

(A) Preparation of l,2'di-0-mcthoxycarbonyl-(D)-desosamine : To LOO g 
15 of (D)-desosamine (4.74 nunol) in 50 mL CH2CI2 was added 3.06 g of 

diisopropylethylaminc. The mixture was stirred at ambient temperature for 10 min, 
then cooled to 4*C. Methyl chloroformate (1 .34 g) was added dropwise at 4**C. The 
reaction mixture was allowed to warm to ambient temperature and stirred overnight. 
The solvent was evaporated to dryness, ethyl acetate (150 mL) was added to extract 
20 the product, and the remaining solid was filtered. The ethyl acetate was removed 
under vacuum and the crude product was purified on a silica gel column (ethyl 
acetate:methanol:triefhylamine 84:5:1 v/v/v) to give 1.29 g of product (88% yield). 

(B) Preparation of l-(2-mercaptopyrimidinyl)-2-0-methoxycarbonyl-(D)- 
desosamine : A mfacture of i;2-di-0-methoxycarbonyL(D)-desosamine (LOO g, 3.436 

2 5 ramol) and 0.7697 g of 2-mercaptopyrimdine (6.872 mmol) m a 25 mL 2-neck flash is 

dried under vacuum for 45 minutes. Dichloroethane (10 mL), toluene (5 mL), and 
DMF (5 mL) were added and stirred at ambient temperature followed by addition of 
7 mL of SnCU (IM in CH2CI2). The reaction mixture was kq>t at 80''C overnight. 
The reaction was terminated by addition of IN NaOH until the mixture turned basic. 

3 0 The solution was extracted with 300 mL of ethyl acetate and the organic layer was 

washed with saturated aqueous NaHCOs (3 x 150 mL), dried over Na2S04, filtered, 
and evaporated. The product was purified on a silica gel column (1:1 ethyl 
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acetate:hexanes to ethyl acetate with 1% triethylamine) to obtain 0.25 g of l-(2- 
mercaptopyriimdinyl)-2-0-methoxycarbonyl-(D)-desosamine and 0.5 g of recovered 
l,2-di-0-methoxycarbonyl-(D)-desosamine, 

Example 9 

Preparation of 5-0-[I-p-f2-0-methoxvcarbonvl-fDVdesosaminvD1-6- 
deoxvervthronolide B and 5-0-(l-p-fDMesosaminvlVdeoxvervthronolide B 
(Compounds 86 and 87 in Figure 8) 

(A) A mixture of 6-deoxyerythronolide B (6-DEB) (1 5 mg, 39 umol) and 
l-(2-mercaptopyrimidinyl>2-0-methoxycarbonyl-(D)-desosaniine (65 mg, 200 umol) 
was dried under vacuum, then placed under a nitrogen atmosphere. To this was added 
CH2CI2 (1 mL), toluene (0.5 mL), and powdered 4A molecular sieves (50 mg), and 
the mixture was stirred for 10 minutes at ambient temperature. Silver 
trifluoromethanesulfonate (64 mg, 250 umol) was added and the reaction was stirred 
until LC/MS analysis indicated completion (18-20 hours). The mixture was filtered 
through anhydrous NaaSOA and evaporated to yield crude product. The residue was 
dissolved in several drops of acetonitrile and loaded on a C- 18 solid phase extraction 
cartridge (Whatman), Unreacted desosamine was removed by washing with 20% 
CH3CN/H2O and glycosylation products and the remaining macrolide aglycone were 
recovered by eluting with 100% CH3CN. Final separation was carried out by HPLC 
using a semiprep C-18 column (10 mm X 150 mm) (CH3CN/H20, 20% isocratic over 
5 min, then 20% to 80% over 30 min). HPLC fractions were checked by LC/MS and 
fractions containing the same product were combined. The solvent was removed 
under vacuum, yielding 8.4 mg of 5-0-[l-P-(2-0-methoxycarbonyl-(D)- 
desosaminyl)]-6-deoxy-erythronolide B (compound 86 in Figure 8) (36% yield), 
APCI-MS gives [M+H]+ = 602, 

(B) 5-0«[l-(2-0-methoxycarbonyl-(D)-desosaminyl)]-6- 
deoxyerythronolide B (1-6 mg) from paragraph (A) was dissolved in 1 mL methanol, 
0.2 mL H20, and 0,2 mL triethylamine and kept at 70^0 for 3 hours. Removal of the 
solvent under vacuum gave crude product. This was dissolved in a few drops of 
CH3CN and applied to a Whatman CI 8 solid phase extraction cartridge. The column 
was washed with 25 mL of 20% CH3CN in water, then the product was eluted with 
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100% CH3CN. Evaporation of the solvent gave 5-0-(l-P-(D)-desosaininyl>6- 
deoxyerythronolide B (compound 87 in Figure 8) in quantitative yield. APCI-MS 
gives [M+H]+ = 544, 

Example 10 

Preparation of S-0-ri-P'f2-0-methoxvcarfaonvl-n3Vdesosaminvni-8.8a- 
deoxvoleandolide and 5-0-n-p-fDVdesosaminvm-8,8a-deoxvoleandolide 
(Compounds 88 and 89 in Figure 8) 

(A) Treatment of 8,8a-deoxyoleandolide (12 mg) as described in Example 
9(A) yielded 5-0-[l-P-(2-0-methoxycarbonyl-(D)-desosaminyl)]-8,8a- 
deoxyoleandolide (60% yield) (compound 88 in Figure 8). APCI-MS gives [M+H]+ 
= 508. 

(B) Treatment of 5-0-[l -P-(2-methoxycarbonyl-(D)-desosaminyl)]-8,8a- 
deoxyoleandolide of paragraph (A) as described in Example 9(B) gave 5-0-(l-P-(D)- 
desosaminyl)-8,8a-deoxyoleandolide (compound 89 in Figure 8) in quantitative yield. 
APCI-MS gives [M+H]+ = 530. 

Example 1 1 

Preparation of 5-0-fl-B-f2-methoxvcarbonvl-(DVdesosaminvl)]-3.6-dideoxv-3» 
oxoervthronolide B (Compound 83 in Figure 8^ and S.l l-bis-(0-n-p-f2* 
methoxycarbonyl-(D>desosaminyl)l>3,6-dideoxy-3-oxoerythronolide B (Compound 

92 in Figure 8) 

Treatment of 3,6-dideoxy-3-oxoerythxonolide B (6 mg) as described in 
Example 9(A) gave 5-0-[l-p-(2-0-methoxycarbonyl-(D)-desosaminyl)]-3,6-dideoxy- 
3-oxoerythronolide B_(compound 83 in Figure 8) in 44% yield. APCI-MS gives 
[M+H]+ = 600. A second product, 5,ll-bis-(0-[l-P-(2-0-methoxycarbonyl-(D)- 
desosaniinyl)])-3,6-dideoxy-3-oxoerythronolide B (compound 92 in Figure 8), was 
also isolated from this mixture in 26% yield; APCI-MS gives [M+H]+ - 815. 
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Example 12 

Preparation of 5-0-fl-B-(T)VdesosammvlV3>6-dideoxv-3-oxocrvthronolide B 
(Compound 91 in Figure 8) and of S.I l-bis-0-n-p-fl3)'desosaininvn-3,6-dideoxv-3- 
oxoerythronolide B (Compound 93 in Figure 8) 
5 Treatment of 5-0-[ 1 -p-(2-methoxycarbonyl-(D)-desosaminyl)]-3,6-dideoxy-3- 

oxoerythronolide B as described in Example 9(B) gave 5-0-(l-p-(D)-desosaminyl)- 
3,6-dideoxy-3-oxoerythronolide B of Example 1 1 (compound 91 in Figure 8) in 
quantitative yield. APCI-MS gives [M+H]+ 542. 

Treatment of 5 Jl.bis-(0-[l-P-(2-methoxycaibonyl"{D)-desosaniinyl)])-3,6- 
1 0 dideoxy-3-oxoerythronolide B of Example 1 1 as described in Example 9(B) gave 

5,1 l-bis-0-(l-p-(D)-desosaminyl)-3,6-dideoxy-3-oxoerythronolideB (compound 93 
in Figure 8) in quantitative yield. APCI-MS gives [M+H]+ - 699. 

Example 13 

15 Preparation of 2'-O-methoxycarbonyl-(8R,9S)-10-deoxy-8,9-dihydro-9-hydroxy-8- 

methvlmethvmvcin (Compound 83 in Figure %) and 3.9-bis-(0-f l-p-(2- 
methoxycarfaonyl-(D)-desosaminyl)l)'(8R,9S)-10-deoxy-8,9-dihydro-9-hydroxy-8- 
methylmethonolide (Compound 84 in Figure 8) 
Treatment of (8R,9S> 1 0-deoxy-8,9-dihydro-9-hydroxy-8-methylmethymycin 
20 (12 mg) according to the procedure of Example 9(A) yielded 2 -0-methoxycarbonyl- 
(8R,9S)-10-deoxy-8,9-dihydro-9-hydroxy-8-methylmethymycin (compound 83 in 
Figure 8) (34%); APCI-MS gave [M+H]+ 544. A second product, 3,9-bis-(0-[l-p- 
(2-0-methoxycarbonyl-(D)-desosaminyl)])-(8R,9S)- 1 0.deoxy-8,9-dihydro-9- 
hydroxy-8-methylmethonolide (compound 84 in Figure 8), was also isolated from this 
2 5 mixture (33%); APCI-MS gave [M+H]+ = 759, 

Example 14 

Preparation of (8!t9S)-10-deoxy-8,9-dihydro-9-hydroxy-8-methyhnethymycin 
(Compound 83 in Figure 8) and of (8R.9SV10-deoxv-8.9-dihvdro-9-(l-B-(DV 
30 desosaminyloxy)-8-methyhnethymycin (Compound 85 in Figure 8) 

Treatment of 2'-O-methoxycarbonyl-(8R,9S)-10-deoxy-8,9-dihydro-9- 
hydroxy-8-methylmethymycin of Example 13 as described in Example 9(B) gave 
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(8R,9S)-10-deoxy-8,9-dihydro-9-hydroxy-8-methylmethymycin (compound 83 in 
Figure 8) in quantitative yield. APCI-MS gives [MH-HJ+ = 486. 

Treatment of 3,9-bis-(0-[ 1 - p-(2-methoxycarbonyl-(D)-desosaminyl)])- 
(8R,9S)"10-deoxy-8,9-dihydro-9-hydroxy-8-methyImethonolide of Example 13 as 
described in Example 9(B) gave (8R,9S>10-deoxy-8,9-dihydro-9-hydroxy-8- 
methylmethymycin (compound 85 in Figure 8) in quantitative yield (elution from the 
CI 8 solid-phase extraction cartridge was with 100% methanol). APCI-MS gives 
[M+H]+ = 643. 

Example 15 

Preparation of 14>15-dehydroerythromycin A (Compound 95 in Fijasure 9) 
A sample of 14,15-dehydro-6-deoxyerythronolide B (0.75 mg) from Example 
7 was dissolved in 0.6 mL of ethanol and diluted to 3 mL with sterile water. This 
solution was used to overlay a 3 day old culture of Saccharopofyspora erythraea 
WHM34 (eryA) grown on a 100 mm R2YE agar plate at 30**C. After drying, the plate 
was incubated at 30*'C for 4 days. The agar was chopped and extracted 3 times with 
100 mL portions of 1% triethylamine in ethyl acetate. The extracts were combined 
and evaporated. The crude product was purified by preparative HPLC (CI 8 reversed 
phase, water-acetonitrile gradient containing 1 % acetic acid). Fractions were 
analyzed by mass spectrometry, and those containing pure 14,15- 
dehydroerythromycin A (compound 95 in Figure 8) were pooled, neutralized with 
triethylamine, and evaporated to a symp. This was dissolved in water and extracted 3 
times with equal volumes of ethyl acetate. The organic extracts were combined, 
washed once with saturated aqueous NaHC03, dried over Na2S04, filtered, and 
evaporated to yield 0.15 mg of product. APCI-MS gives [M+H]+ = 733. 

Example 16 

Preparation of 14"oxo-8.8a-deoxyoleandolide (Compound 108) and 8,8a- 
deoxyoleandolide-M-carfaoxylic acid (compound 109) and Derivatives Thereof 
These compoimds can be prepared through ozonolysis of 14,15-dehydro-6- 
deoxyerythonolide B (compound 94 of figure 9). 
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A solution of compound 94 in methanol is cooled to -40°C, and ozone is 
bubbled into the solution until formation of I2 is observed in a KI solution attached to 
the outlet of the reaction vessel. Excess ozone is purged from the solution by 
sparging with nitrogen gas, providing a solution of the ozonide of compound 94. 
5 Treatment of this solution with MeaS will reduce the ozonide to the aldehyde, 
compound 108. 

Alternatively, the ozonide can be oxidized by addition of H2O2 to provide the 
corresponding carboxylic acid, compound 109. 

Methods for converting the aldehyde to amines via reductive amination (e.g., 
1 0 using an amine and NaBHaCN under mildly acidic conditions, or through formation 
of an oxime followed by catalytic hydrogenation) are well known in the art. Similarly 
well known are methods for converting the carboxylic acid into esters or amides such 
as compoimd 110 (e.g., through activation using a carbodiimide reagent in the 
presence of an alcohol or an amine). Diamines in either procedure are used to 
15 produce dimeric macrolides such as compounds 1 1 1 and 1 12. (See Figure 6H) 

Example 17 

Diketide thioester Synthesis: (2S,3R>-3-hydroxy-2-methyl-4"pentenoic acid 
N-acetylcysteamine thioester 
2 0 All diketide thioesters were synthesized by a common procedure. Illustrated 

here is the synthesis of (2S,3R)-3-hydroxy-2-methyl-4-pentenoic acid N- 
acetylcysteamine thioester. Enantioselective syn-aldol condensations were perfomed 
according to the procedure of D.A. Evans et al., J Am Chem Sac (1992) 1 14:9434- 
9453. Subsequent manipulations followed the general procedures of D.E. Cane et al., 
25 JAntibiotics (1995) 4S: 647-651. 

The synthesis of [4S,3(2S,3R)]-4-benzyl-3-(3-hydroxy-2-methyl-4- 
pentenoyl)-2-oxazolidinone by aldol condensation between (4S)-N-propionyl-4- 
benzyl-2-oxazolidinone (L17 g, 5.0 mmol) and acrolein (0.4 mL, 1 1 mmol) was 
performed as described by D.A. Evans et al., J Am Chem Soc (1992) 1 14:9434-9453, 
30 yielding 0.72 g of the adduct (50% yield) after chromatography on Si02(2:l 
hexane/ethyl acetate). 
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The aldol adduct was treated with t-butyldimethylsilyl 
trifluoromethanesulfonate (0.63 mL, 2.7 nunol) and 2,6-lutidine (0.35 mL, 3 mmol) in 
THF at 0°C, yielding the 0-silyl ether in quantitative yield after chromatogr^hy (4:1 
hexane/ethyl acetate). 
5 A solution of the 0-silyl ether hi 20 mL of THF was cooled on ice, and 

2.8 mL of water was added followed by 0.61 mL of 50% H2O2. After 10 min, a 
solution of 2 1 5 mg of LiOH*H20 in 2 mL of water was added. The reaction was 
monitored by TLC, which revealed completion after 1 hoxir. A solution of 1 .25 g of 
sodium sulfite in 8 mL of water was added, and volatiles were removed by rotary 

1 0 evaporation under reduced pressure. The resulting aqueous mixture was extracted 

three times with 20 mL portions of CH2CI2, then acidified to pH 2 using 6N HCl and 
extracted 3 times with 50 mL portions of ethyl acetate. The ethyl acetate extracts 
were combined, washed with brine, dried over Na2S04, filtered, and evaporated to 
provide the product acid as a colorless oil, 470 mg (70%). 

1 5 The acid was dissolved in 1 0 mL of anhydrous dimethylformamide and cooled 

on ice. After addition of diphenylphosphorylazide (1.25 mL) and triethylamine 
(L06 mL), the mixture was stirred for 2 hrs on ice. 

N-acetylcysteamine (1.5 mL) was added, and the mixture was stirred 
overnight at room temperature. After dilution with water, the mixture was extracted 3 

2 0 times with ethyl acetate. The extracts were combined, washed with brine, dried over 
Na2S04, filtered, and evaporated to provide the crude 0-silyl thioester. 
Chromatography (1:1 hexane/ethyl acetate) provided pure product (460 mg, 70%). 

The 0-silyl thioester (400 mg) was dissolved in 25 mL of acetonitrile, and 
5 mL of water was added followed by 2 mL of 48% HF. After 2 hours, an additional 

25 2 mL of 48% HF was added. After a total of 3.5 hours, the reaction was stopped by 
addition of sat NaHCOj to neutral pH. The product was extracted with 3 portions of 
ethyl acetate, and the combined extracts were washed with brine, dried over Na2S04, 
filtered, and ev25)orated to provide the desilylated thioester. Chromatography (Ethyl 
acetate) gave 150 mg (56%) of pure (2S,3R)-3-hydroxy-2-methyl-4-pentenoic acid N- 

30 acetylcysteamine thioester, APCI-MC: [M+H]+ = 232. IH-NMR (CDC13): d 5.83, 
IH, ddd (1=5.6,10.8,17.5); 5.33, IH, ddd (J=1.6,L6,16.9); 5.22, IH, ddd 
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(1=1,5,1.5,10.8); 4.45, IH, m; 3.45, 2H, m; 3.04, 2H, m; 2.82, IH, dq (J=4.3,6.8); 
1,96, 3H, s;1.22, 3H,d(J=6.8). 

Other diketide thioesters were prepared by substitution of appropriate 
aldehydes in place of acrolein. 

5 

Example 18 
Measurement of Antibacterial Activity 
Antibacterial activity was determined using either disk diffusion assays with 
Bacillus cereus as the test organism or by measurement of minimum inhibitory 
1 0 concentrations (MIC) in liquid culture against sensitive and resistant strains of 
Staphylococcus pneumoniae. 
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1 . A method to prepare a nucleic acid with a nucleotide sequence 
encoding a modified PKS from a nucleotide sequence encoding a naturally occurring 
5 modular PKS wherein said naturally occurring modular PKS contains first regions 
which encode enzymatic activities and second regions which encode scaffolding 
amino acid sequences, which method comprises modifying at least one said first 
region. 

10 2. The method of claim 1 wherein said modifying comprises deleting or 

inactivating at les^t one said first region; or 

wherein said modifying comprises replacing at least one said first region with 
a region encoding the corresponding enzymatic activity from a different naturally 
occurring PKS gene or from a different region of the same naturally occurring PKS 

15 gene. 

3. The method of claim 1 or 2 wherein said nucleotide sequence encodes 
at least three PKS modules. 



20 4. The method of any of claims 1 -3 wherein said modifying results in 

utilization of a different extender unit; and/or 

wherein said modifying results in utilization of a different starter unit; and/or 
wherein said modification results in a polyketide of a different chain length. 

25 5 . A nucleic acid comprising a nucleotide sequence encoding a modified 

PKS obtainable by the method of any of claims 1-4. 



6. A cell culture modified to contain the nucleic acid of claim 5. 



30 



7, A method to prepare a polyketide which method comprises culturing 
the cells of claim 6 xmder conditions wherein said polyketide is produced. 
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8. A novel polyketide prepared by the method of claim 7. 

9. A method to prepare an antibiotic which method comprises 
glycosylating the polyketide of claim 8. 

10. An antibiotic prepared by the method of claim 9, 

11. A method to construct a library of colonies containing expression 
vectors for a multiplicity of different polyketide synthases which method comprises 
transforming recombinant host cells with a mixture of expression vectors containing 
the nucleotide sequences obtained by the method of any of claims 1-4; and 

separating the transformed cells into individual colonies, and culturing the 
colonies. 

12. A method to prepare a polyketide combinatorial library which method 
comprises culturing the library of colonies obtained by the method of claim 1 1 under 
conditions wherein said polyketides are produced. 

13. A multiplicity of cell colonies comprising a library of colonies wherein 
each colony of the library contains an expression vector comprising a nucleotide 
sequence encoding a modular PKS derived from a naturally occurring PKS gene 
cluster wherein at least one enzymatic activity has been deleted and/or replaced by a 
different version of said activity or is mutated so as to result in a polyketide other than 
that produced by said naturally occurring PKS and 

wherein the nucleotide sequence contained in each colony in the library 
encodes a different PKS. 

14. The multiplicity of cell colonies of claim 13 wherein in said library of 
colonies said naturally occurring PKS gene cluster is the erythromycin gene cluster; 
and/or 
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wherein, in at least one colony of said library, said different version is the 
corresponding enzymatic activity from a different modular PKS or from another 
location in the same PKS gene cluster; and/or 

wherein the number of PKS modules contained in the expression vector is 
different in at least two colonies of the library; and/or 

wherein die extender unit utilized by the encoded PKS is different in at least 
two colonies of said library; and/or 

wherein die starter unit utilized by the enclosed PKS is different in at least two 
colonies of said library; and/or 

wherein the reduction cycle specificities are different in at least two colonies 
of said library. 

15. A method to produce a library of modular PKS proteins which method 
comprises culturing the multiplicity of cell colonies or the library of colonies of claim 
13 or 14 xmder conditions wherein said expression vectors effect production of said 
modular PKS proteins. 

16. A library of PKS proteins prepared by the method of claim 15. 

17. A multiplicity of cell colonies comprising a library of colonies wherein 
each colony of the library contains a modular PKS derived from a naturally occurring 
PKS wherein at least one enzymatic activity has been deleted or replaced by a 
different version of said activity or is produced from a mutated form of said gene so 
as to resuh in a polyketide other than that produced by said naturally occurring PKS, 
and 

each colony in the library contains a different PKS. 

1 8. The multiplicity of cell colonies of claim 17 wherein said naturally 
occurring PKS is the erythromycin PKS; and/or 

wherein the number of modules of PKS is different in at least two colonies of 
the library; and/or 
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wherein the extender unit utilized by the PKS is different in at least two 
colonies of the library; and/or 

wherein the starter unit utilized by the PKS is different in at least two colonies 
of the library; and/or 

wherein the reduction cycle specificities are different in at least two colonies 
of said library. 

19. A method to produce a combinatorial library of polyketides which 
method comprises culturing the cell colonies or library of colonies of claim 17 or 18 
under conditions wherein polyketides whose synthesis is effected by said different 
PKS proteins are produced. 

20. A combinatorial library of polyketides prepared by the method of 
claim 19. 

21 . A multiplicity of polyketides which comprises a combinatorial library 
of polyketides which results from culturing colonies containing polyketide synthases 
derived from a naturally occurring PKS wherein at least one enzymatic activity has 
been deleted and/or replaced by a different version of said activity or is mutated so as 
to result in a polyketide other than that produced by said naturally occurring PKS, 
wherein each PKS in said library produces a different polyketide. 

22. The library of claim 21 wherein the chain length is different in at least 
two polyketides; and/or 

which contains at least two polyketides formed from different extender units; 

and/or 

which contains at least two polyketides of different oxidation states; and/or 
which contains at least two polyketides of differing stereochemistry; and/or 
which contains at least two polyketides formed from different starter units. 

23. A method to identify a successful candidate polyketide which binds to 
or reacts with a target moiety, which method comprises screening the library of claim 
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20, 21 or 22 by contacting each polyketide in said library with the target moiety under 
conditions wherein a successful candidate would form a complex with said target 
moiety, and 

detecting any complex formed, thus identifying a polyketide of the library as 
5 the successful candidate. 



24. A compound of the formula: 




including the glycosylated and isolated stereoisomeric forms thereof; 
1 0 wherein R* is a straight chain, branched or cyclic, saturated or unsaturated 

substituted or unsubstituted hydrocarbyl of 1 -1 5C; 

each of R* and R^ is independently H or alkyl (1-4C) wherein any alkyl at 
may optionally be substituted; 
XMsH2, HOHor=0; 
1 5 with the provisos that: 

at least one of R^ and R^ must be alkyl (MC); and 

the compound is other than compounds 1, 2, 3, 5 and 6 of Figure 6A; 

or of the formula: 




including the glycosylated and isolated stereoisomeric forms thereof; 
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wherein R* is a straight chain, branched or cyclic, saturated or unsaturated 
substituted or unsubstituted hydrocarbyl of 1-8C; 

each of R*, R^ and R^ is independently H or alkyl (1-4C) wherein any alkyl at 
R^ may optionally be substituted; 

each of and is independently H2, HOH or =0; 

with the proviso that: 

at least two of R^ R^ and R^ are alkyl (1-4C); 
or of the formula: 



including the glycosylated and isolated stereoisomeric forms thereof; 

wherein R* is a straight chain, branched or cyclic, saturated or unsaturated 
substituted or xmsubstituted hydrocarbyl of 1-1 5C; 

each of R\ R^ and R^ is independently H or alkyl (1-4C) wherein any alkyl at 
R* may optionally be substituted; 

each of X\ X^ and X^ is independently H2, HOH or =0; 

with the provisos that: 

at least one of R^ and R^ must be alkyl (1-4C); and 
the compound is other than compound 8 of Figure 6 A; 

or of the formula: 



X 





1 



R 
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including the glycosylated and isolated stereoisomeric forms thereof; 

wherein R* is a straight chain, branched or cyclic, saturated or unsaturated 
substituted or unsubstituted hydrocarbyl of 1-8C; 

each ofR\ and R^ is independently H or alkyl (MC) wherein any alkyl at 
R' may optionally be substituted; 

each of X* and is independently H2, HOH or =0; 

with the proviso that: 

at least one of R^ and R^ is alkyl (MC); and 

the compound is other than compound 9 of Figure 6A; 

or of the formula: 



including the glycosylated and isolated stereoisomeric forms thereof; 

wherein R* is a straight cham, branched or cyclic, saturated or unsaturated 
substituted or unsubstituted hydrocarbyl of 1-15C; 

each of R^-R^ is independently H or alkyl (1-4C) wherein any alkyl at R^ may 
optionally be substituted; 

rSs alkyl (1-5C); 

each of X' and X^ and X* is independently H2, HOH or O; 

with the proviso that: 

at least two of R^-R"^ are alkyl (MC); 




■1 
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or of the formula: 




including the glycoslated and isolated stereoisomeric fonns thereof; 

wherein R* is a straight chain, branched or cyclic, saturated or unsaturated 
substituted or unsubstituted hydrocaibyl of 1-15C; 

each of R^-R^ is independently H or alkyl (1-4C) wherein any alkyl at R' may 
optionally be substituted; 

R^s alkyl (1-5C); 

xMsOHorH; 

each X\ X^ and is independently H2, HOH or =^0; or X* is H and 
the compound of formula (8) has a 7c-bond between positions 9-10, with the 
proviso that: 

if X^ is H, at least one of X^ and X^ is HOH or =0. 

25. A compound of the formula: 




including the glycosylated and isolated stereoisomeric forms thereof; 
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wherein R is a straight chain, branched or cyclic, saturated or unsaturated 
substituted or unsubstituted hydrocarbyl of 1 -1 5C; 

each of R*, R^, R^, R"^ and R^ is independently H or alkyl (1-4C) wherein any 
aUcyl at R' may optionally be substituted; 

each of X*, X^ and X* is independently H2, HOH or =0; or 

X* or X^ or X^ or X^ is H and the compound of formula (5) contains a 7c-bond 
at positions 8-9 or 6-7 or 4-5 or 2-3; 

with the proviso that: 

at least two of R'-R^ are alkyl (1-4C); and 

the compound is other than compound 13 or 14 of Figure 6A or compound 
205, 210-213 of Figure 11. 

26. The compoimd of claim 25 wherein at least three of R*-R^ are alkyl (1- 
4C); and/or 

wherein X^ is -OH; and/or 
X^ is =0; and/or 
X^ isR 

27. A compound of the formula: 



including the glycosylated and isolated stereoisomeric forms thereof; 
wherein R* is a straight chain, branched or cyclic, saturated or unsaturated 
substituted or unsubstituted hydrocarbyl of 1-1 5C; 
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each of R*-R* is independently H or alkyl (1-4C) wherein any alkyl at may 
optionally be substituted; 

each of X*-X^ is independently H2, HOH or =0; or 

each of X*-X^ is independently H and the compoimd of formula (5) contains a 
7C-bond in the ring adjacent to the position of said X at 2-3, 4-5, 6-7, 8-9 and/or 10-11; 
with the proviso that: 
at least two of R^-R* are alkyl (MC); and 

the compound is other than compounds 17, 24 or 28 of Figure 6B, compound 
301-311 of Figure 12(A) or compound 312-322 of Figure 12(B). 

28. The compound of claim 27 wherein at least three of R^-R^ are alkyl; 

and/or 

X^ is =0; and/or 

X' is OH; and/or 

X^ and X^ are OH; and/or 

R* is substituted alkyl and/or 

R^ is substituted alkyl. 
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FURTHER INFORMATION CONTINUED FROM PCT/ISA/ 210 

This International Searching Authority found multiple (groups of) 
inventions in this international application, as follows: 

1. Claims: 1-23 

methods to prepare polyketide synthases libraries , nucleic 
acids with modified polyketide synthases , cell colonies* 
library of polyketide synthases proteins , methods of 
identification of target moeities , polyketide and 
antibiotics prepared by a method comprising modified 
polyketide synthases nucleic acids . 

2. Claim : 24 partially 

polyketide compound of formula (1) 

3. Claim : 24 partially 

polyketide compound of formula (2) 

4. Claim : 24 partially 

polyketide compound of formula (3) 

5. Claim : 24 partially 

polyketide compound of formula (4) 

5, Claim : 24 partially 

polyketide compound of formula (7) 

7. Claim : 24 partially 

polyketide compound of formula (8) 

8. Claims: 25»26 

polyketide compound of formula (5) 

9. Claims: 27.28 

polyketide compound of formula (6) 
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